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© Fault tolerant storage controller utilizing tightly coupled dual controller modules. 
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© A fault tolerant storage controller utilizing tightly coupled dual controller modules. The controller modules 
each check to see if another controller module or cache module is present and, if so, then all configuration 
information with respect to the controller modules and attached devices are shared between them. Configuration 
information may be entered into either or both of the controller modules and the information is shared 
dynamically. Each cache module may be "locked" by an individual controller module to prevent the other 
controller module from inadvertently disturbing the contents of the other controller module's cache. During 
initialization, each controller module checks for the existence of an associated cache module and, if present, it is 
immediately "locked" by the controller module. Should a controller module fail or give an indication of a 
malfunction, the other controller module will disable or "kill" the malfunctioning controller module thereby 
resetting it and releasing any lock it may have had on its cache module. In those instances where the cache 
module is a write cache, the surviving controller module can resume operations where the malfunctioning 
controller module left off and complete any remaining writes to the disabled controller module's storage devices 
preventing the loss of any host computer data. The controller modules are tolerant of the other controller module 
failing and then rebooting and the sequence of events is detected and recognized by the surviving controller 
module such that it does not disable the one that failed. The dual controller modules communicate asyn- 
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chronously to verify that they are each operational and to exchange and verify configuration information and to 
provide operational status dynamically. 
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BACKGROUND OF THE INVENTION: 

The present invention relates, in general, to the field of storage controllers for interfacing one or more 
host computers to one or more computer mass storage devices such as disk drives. More particularly, the 
5 present invention relates to a fault tolerant storage controller utilizing tightly coupled dual controller modules 
which provides essentially fail-safe controller initialization and fail-over in the event that one of the controller 
modules malfunctions. 

Storage controllers are commonly utilized to off-load certain lower level processing functions relating to 
data storage and retrieval from a host computer and to serve as an interface between the host and the 
10 physical storage media, which in most normal operations, may be a number of disk drives coupled to one 
or more storage buses. The controller is, therefore, one of the most critical elements in the system 
input/output ("I/O") performance chain. 

As a consequence, it is critical for continued system operation that the host computer(s) be able to 
continue to reliably read and write data from and to the storage media without interruption. Thus, to the 
75 extent possible, the storage controller must be fault tolerant. Moreover, balancing factors such as cost, 
performance and reliability, it is also important that the storage controller be user configurable, for example, 
as a redundant array of inexpensive disks ("RAID") controller. 

SUMMARY OF THE INVENTION 

20 

Disclosed herein is a fault tolerant storage controller utilizing tightly coupled dual controller modules. 
The controller modules of the fault tolerant storage controller check to see if another controller module, 
cache module or other auxiliary module is present electrically coupled and, if so, all configuration 
information with respect to the controller modules and their attached devices are shared between them. 

25 Configuration information, which may be entered into either or both of the controller modules, is shared 
dynamically. Each cache module of the fault tolerant storage controller may be "locked" by an individual 
controller module to prevent the other controller module from inadvertently disturbing the contents of the 
opposite controller module's cache. 

During initialization, each controller module checks for the existence of an associated cache module 

30 and, if present, it is immediately locked by that controller module. Should an individual controller module fail 
or give an indication of a malfunction, the other controller module will disable or "kill" the malfunctioning 
controller module, thereby resetting and releasing any lock it may have had on its cache module. In those 
instances where the cache module is a write cache, the surviving controller module can resume operations 
where the malfunctioning controller module left off and complete any remaining writes to the disabled 

35 controller module's storage devices preventing the loss of any host computer data. The controller modules 
are also tolerant of the other controller module failing and then rebooting and the sequence of events is 
detected and recognized by the surviving controller module such that it does not disable the one that failed. 
The dual controller modules of the fault tolerant storage controller of the present invention communicate 
asynchronously to verify that they are each operational and to exchange and verify configuration information 

40 and any state changed with respect to attached devices dynamically. This invention in its broad form 
resides in a computer mass storage device controller system as recited in claim 1. The invention also 
resides in a fault tolerant computer mass storage device controller as recited in claim 4. 

Specifically provided herein is a computer mass storage device controller system which comprises first 
and second controller modules operatively coupled to a plurality of computer mass storage devices. The 

45 first controller module controls a first subset of the plurality of computer mass storage devices and the 
second controller module controls a second subset. A first signal line communicates between the first and 
second controller modules to indicate the presence of the first controller module to the second controller 
module and the second controller module to the first controller module. A communications channel 
interconnects the first and second controller modules with the first and second controller modules being 

so responsive to a controller present signal on the first signal line to enable communications between the first 
and second controller modules over the communications channel. A second signal line couples the first and 
second controller modules with the second signal line enabling the first controller module to functionally 
disable the second controller module in response to an indication on the communications channel that the 
second controller module may have malfunctioned. The first controller module thereupon assumes control 

55 of the second subset of the plurality of computer mass storage devices from the second controller module. 

Additionally disclosed herein is a fault tolerant computer mass storage device controller which 
comprises first and second controller modules for operative coupling to a plurality of computer mass 
storage devices. The first controller module controls a first subset of the plurality of computer mass storage 

3 
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devices and the second controller module controls a second subset. A plurality of commun,cat.ons lines 
interconnect the first and second controller modules. The communication lines provide configuration and 
operational information between the first and second controller modules. The communication l.nes further 
include a controller module disable line for assertion by either of the controller modules to operative^ 
disable the other of the controller modules in response to a malfunction indication from the other of he 
controller module. In operation, the first controller module may operatively disable the second control er 
module by asserting the controller module disable line whereupon the first controller module add.tionally 
assumes control of the second subset of the plurality of computer mass storage devices. 

Still further disclosed herein is a computer mass storage device controller system which comprises first 
and second controller modules for operative coupling to a plurality of computer mass storage devices^ The 
first controller module controls a first subset of the plurality of computer mass storage devices and the 
second controller module controls a second subset thereof. A communications medium interconnects the 
first and second controller modules. The communications medium conveys a controller present indication 
from one of the controller modules to the other of the controller modules when one of the controller 
modules .s included in the system. The communication medium includes a plurality of controller module 
commutation l.nes for communications between the first and second controller modules when the 
controller present .nd.cat.on is detected by both the first and second controller modules. The communica- 
tion medium addit.onally enables one of the controller modules to functionally disable the other of the 
controller modules upon detection of a malfunction of the other of the controller modules. 

BRIEF DESCRIPTION OF THE DRAWINGS 

A more detailed understanding of the invention may be had from the following description of a preferred 
embodiment, given way of example and to be read in conjunction with the accompanying drawing wherein: 
Fig. 1 is a detailed logic block diagram of a storage controller module in accordance with the present 
invention; 

Fig 2 is an overview logic block diagram of a pair of tightly coupled storage controller modules as 
individually depicted in Fig. 1, illustrating each controller's respective cache module in a dual controller 
configuration and illustrating the inter-controller communication therebetween; 

Fig 3 is a simplified logic block diagram of the tightly coupled storage controller modules and respective 
cache modules of Fig. 2 further illustrating the coupling of the storage controller modules to a number of 

computer mass storage device buses; ,„__...„. 

Fig 4 is a functional logic block diagram of the dynamic random access memory ( DRAM ) controller 
and arbiter ("DRAB") integrated circuit chip of the storage controller and cache modules of Figs. 1 and 

Fig. 5 is a simplified logic diagram and register map of the FX chip of the storage controller module of 

Figs "to and 6b are illustrations of the write diagnostic register 0 and write diagnostic register 1 of the 
storage controller module of Fig. 1 ; and 

Figs. 7a and 7b are illustrations of the read diagnostic register 0 and read diagnostic register 1 of the 
storage controller module of Fig. 1. 

DESCRIPTION OF A PREFERRED EMBODIMENT 

With reference to Fig. 1, a controller module 10 for use in the storage controller of the present invention 
is shown. Storage controller module 10 comprises a host-interface 12 and device ports 14. The controller 
module 10 further includes a number of data buses including IBUS data lines 16 and associated BUS 
address lines 18 (collectively sometimes hereinafter referred to as "IBUS 19"). The designation IBUS 19 
refers to the fact that this is the primary bus for interfacing with processor 28 of the controller module 10, 
the processor 28 in the embodiment shown being preferably an Intel 80960CA. 

Similarly, the controller module 10 also includes NBUS data lines 20 and NBUS address l.nes 22 
(hereinafter sometimes collectively referred to as "NBUS 21"). The designation NBUS 21 is utilized as a 
designation in light of the fact that the SCSI port processors 32 0 -32 n and SCSI-FWD port processor 34 may 
be furnished as NCR53C710 and NCR53C720 SCSI port processors, respectively. 

MBUS 24 (hereinafter sometimes referred to as memory data address lines "MDAL") is the primary bus 
associated with the controller module 10 DRAM buffer 30 memory. Additionally, the controller module 10 
further includes a cache data address line ("CDAL") bus 26 primarily associated with the assoc.ated cache 
modules which will be more fully described hereinafter. As shown. IBUS 19 and CDAL bus 26 have wrth 
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them an associated control bus 88 coupling DRAM controller and arbiter ("DRAB") 38 to any associated 
cache module (not shown) through backplane connector 36 as well as to a number of devices coupled to 
IBUS 19, including processor 28. 

DRAB 38 provides an output on DRAM address line ("DADDR") lines 94 to the individual D RAM's 98 of 

5 the DRAM buffer 30. Similarly, DRAB 38 provides a control line input to the same D RAM's 98. Error 
correction code ("ECC") lines 92 input is supplied to DRAB 38 from DRAM buffer 30. An additional bi- 
directional NBUS control line 90 interconnects DRAB 38 with devices coupled to NBUS 21 as will be more 
fully described hereinafter. 

Devices coupled to processor 28 by means of IBUS 19 and controlled by signals on control bus 88 

io include journal RAM 46, timer 50, dual universal asynchronous receiver transmitter ("DUART") 48 and code 
card 40. Additionally coupled to IBUS 19 and control bus 88 is l/D cache 44 which supplies a Match A 
output 72 and Match B output 74 to tag RAM's 70, comprising 4K longwords RAM 76 and 78 as shown. 

Additionally, read diagnostic registers (0) 80 and (1) 82 as well write diagnostic registers (0) 84 and (1) 
86 are also coupled to IBUS 19 as well as control bus 88. 

is MBUS 24 is coupled to CDAL bus 26 by means of a bus exchanger 100 comprising a pair of cross- 

coupled 4-way cross point switches 52. Host-interface 12, which comprises a host bus specific DMA engine 
56, associated interface 58 and connector 60 is coupled to NBUS 21 as well as NBUS control lines 90 and 
NBUS data lines 20 by means of transceiver ("XCVR") 66 as shown. XCVR 68 couples FX chip 54 (which 
will be more fully described hereinafter) to NBUS address lines 22. FX chip 54 (sometimes hereinafter 

20 referred to as "FX RAID Chip") provides bi-directional coupling to XOR buffer ("XBUF") 102 through XB 
ADDR and XD DATA data lines. XBUF 102 is also coupled to NBUS 21. 

Port processors 32 Q -32 n and 34 are coupled to a number of storage device busses 62 0 -62 n and host 
bus 64, respectively, through a number of connectors 65 as shown. Port processor 34 is a DMA engine for 
use with controller module 10 as a SCSI host similar to DMA engine 56 of host-interface 12. 

25 The controller module 10 hardware consists of a core set of functionality coupled with different host 
interfaces 12 and one or more device ports 14 depending on specific implementations. As required 
therefore, host interface 12 may comprise DMA engines 56 and associated interfaces 58 which would allow 
controller module 10 to interface to the Computer Interconnect ("CI") and Digital Storage System 
Interconnect ("DSSI") busses available from Digital Equipment Corp., assignee of the present invention, as 

30 well as any variation of the small computer system interconnect ("SCSI") regardless of whether fast or wide 
or differential or FWD SCSI, and fiber distributed data interconnect ("FDDI") host busses. In the embodi- 
ment shown in Fig. 1, device ports 14 support the SCSI device busses in all its current implementations. 
Controller module 10 provides 1 through as many as 6 device ports on device busses 62 Q - 62 N 
incorporated within device ports 14. 

35 The controller module 10 contains 4 internal busses as previously described and 2 to 7 external busses 
depending on its configuration. The internal busses are the IBUS 19, the MBUS 24, the NBUS 21 and the 
CDAL bus 26. Of these four internal busses, the IBUS 19 is the native bus used by the policy processor 28. 
The MBUS 24 is the bus used to access the shared buffer memory of DRAM buffer 30 for the controller 
module 10. The NBUS 21 is the bus used for the port processors 32, 34. The IBUS 19, NBUS 21 and 

40 MBUS 24 are tied together via bus exchanger 100 and the entities on the busses communicate in one of 
two ways. They may communicate directly via the bus exchanger 100 and only the processor 28 uses this 
technique for writing or reading registers in entities on the NBUS 21. They may also communicate indirectly 
via the shared buffer memory of DRAM buffer 30 or the cache modules as will be more fully described 
hereinafter. This is achieved by either the processor 28 or the NBUS 21 entities writing or reading buffer 

45 memory or cache. 

The CDAL bus 26 is the bus used to access the external cache modules (as will be more fully 
described hereinafter), from the main controller module 10. It may be accessed, if present, via the bus 
exchanger 100 as shown in Fig. 1. 

The common aspects of the IBUS 19, NBUS 21, MBUS 24 and CDAL bus 26 are that they are 32-bit 
so synchronous, with LONGWORD, word, and byte access, burst mode capability, 4 LONGWORD's maximum, 
and byte parity. The CDAL bus 26 and the MBUS bus 24 have multiplexed address and data while the 
IBUS 19 and NBUS 21 have separate address and data buses. 

In the embodiment shown in Fig. 1, controller module 10 has a core consisting of an Intel 80960CA 
processor chip utilized as processor 28. The processor 28 may be a 25 MHz version that provides a 
55 consistent 25 millions of instructions per second ("MIPS") of processing power for the controller module 10. 
Processor 28 is utilized as the policy software processor controlling all but the low level device and host 
port operations. Code card 40 of the controller module 10 comprises a memory card control store which 
incorporates a PCMCIA standard program card with one megabyte of memory. Code card 40 holds the 

5 
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code for the controller module 10 and is used each time the module boots. The code card 40 contents are 
validated upon any boot and copied to the shared memory of DRAM buffer 30 for the controller module 10. 

The shared memory DRAM of DRAM buffer 30 is the portion of the controller module which also 
comprises the DRAB 38 gate array. The shared memory is shared between the IBUS 19 and the NBUS 21 
entities for data structures and/or data buffers. A portion of the shared memory is used during normal 
operation as a read cache in the absence of a cache module such as a read or write-back cache module 
option as will be more fully described hereinafter. With either of these options, the portion of shared 
memory normally used for read cache will be used to hold cache module context for cache lookups. 

The l/D cache 44 is a 32 kbyte static random access memory ("SRAM") cache used by the processor 
28 to achieve faster access to instructions and variables during execution. This is required because of the 
shared memory between the IBUS 19 and NBUS 21. Although the processor 28 has an internal cache, it is 
not large enough to offset the performance degradation that the shared memory may cause. The shared 
memory subsystem, in conjunction with the l/D cache 44, represents a viable cost/performance compromise 
for controller module 10. Data coherency in the l/D cache 44 is maintained by using a "write-through" 
cache design. 

The journal RAM 46 is a non-volatile ("NV") 32 Kbyte battery backed up SRAM used for two different 
functions within controller module 10. First, journal RAM 46 is used to store the parameter information 
entered by the user and by the controller module 10 software. Second, the journal RAM 46 will also be 
used by RAID software in the controller module 10 to cover the RAID "write hole" created by power fails 
during RAID operations. This latter aspect is necessary in the event a controller module 10 is performing 
RAID operations in the absence of a write-back cache or dual controller module's 1 0 having dual journal 
RAM 46. The onboard journal RAM 46 need not be used in controller configurations where there are dual 
controller modules 10 and either of the write-back cache or dual controller module 10 journal RAM 46 
options are installed. However, if a read cache module alone is used, the on-board journal RAM 46 must be 
utilized in order to cover the RAID "write hole" previously described. 

The DUART 48 for the controller module 10 also serves two purposes. In the first instance, one of the 
integrated UART's is used for the controller module 10 console terminal. This is utilized to enter 
configurations, show parameters, control the controller module 10 operations, or run diagnostics and system 
utilities. The second purpose is to use the integrated UART to provide a link between two controller 
modules 10 in a tightly coupled dual controller module configuration via the controller module 10 backplane. 
The UART link is used to share configuration information and to insure that both controller modules 10 are 
running properly for failover purposes. The timer 50 is utilized to provide controller module 10 software with 
a relatively accurate timing interval. It should be noted that, should one (for example) port processor 32 fail 
on a particular controller module 10, the other controller module 10 will take over the devices coupled to 
that particular port. 

The bus exchanger 100, comprising a pair of four-way crosspoint switches 52 is a hardware block that 
allows all pieces of the controller module 10 to operate together. The bus exchanger 100 is basically a 2x2 
crossbar that is very fast and provides access by all controller module 10 functionality to all parts of the 
controller module 10 hardware. 

The FX chip 54, which will be more fully described hereinafter, is the hardware accelerator for RAID 
XOR operations as well as disk and tape compare operations. FX chip 54 allows for the controller module 
10 to read data into shared memory or cache, while also XORing into the FX chip 54. The FX chip 54 is 
also utilized for writing data from the shared memory to the device ports 14 while XORing into the FX chip 
54. Context is carried on a per device port basis, up to 6 ports. The FX chip 54 may also contain the logic 
to function as a time of year ("TOY") clock with an external oscillator and battery. The TOY may be 
accessible over the NBUS 21. 

As previously described, controller module 10 may incorporate a number of different host interfaces 12. 
In addition to FDDI, the controller module 10 host interface 12 may be implemented with the YACI and the 
CIRT chips available from Digital Equipment Corporation for the individual CI ports. The YACI chip is a DMA 
engine that buffers data into and out of the CI port via the shared memory within the controller module 10. 
Set up and maintenance of the CI port is carried out by policy software running in the processor 28. 

Host interface 12 may also comprise a DSSI interface implemented with the NCR53C720 chip as port 
processor 34 along with specific programmable array logic ("PAL") to interface between SCSI and DSSI. 
The NCR53C720 chip runs scripts that are read from shared memory to perform command and DMA 
operations on the DSSI interface. The policy software sets up and maintains this chip during controller 
module 10 operations. 

Still further, host interface 12 may comprise a SCSI fast-wide-differentiai ("FWD") interface imple- 
mented with the same NCR53C720 chip as with the DSSI interface but without the need for the PALs. The 

6 
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SCSI interface can be 16-bit differential or 8-bit differential depending on the host adaptor being used. 

The device ports 14 of controller module 10 in the embodiment shown are SCSI utilizing NCR53C710 
SCSI port processor chips. These chips support 8 bit operations up to FAST synchronous transfer mode 
(less than or equal to 10 megabytes/sec) in accordance with the SCSI specification. The port processors 
5 32 D execute scripts read from shared memory and are set up to run these scripts by the policy software. 
Each device port can have up to 6 devices (6 storage devices for single or dual controller module 10 
configurations). 

The four diagnostic registers of the controller module 10, comprising read diagnostics registers 80, 82 
and write diagnostic registers 84, 86 are incorporated within the controller module 10. The specific bits in 

70 the write diagnostic registers 84, 86 control portions of the controller module 10 or send signals from the 
controller module 10 across, the backplane to the optional cache modules as will be more fully described 
hereinafter. The processor 28 has 9 interrupt inputs. However, since there are more than 9 interrupt sources 
on the controller module 10, several interrupts are ORed together. In some cases, it is necessary for the 
processor 28 to read diagnostic registers 80, 82 to determine the specific cause of an interrupt. The write 

75 diagnostic registers 84, 86 are used by diagnostic and functional software in order to test or control the 
controller module 10 operation. Certain bits control test modes for diagnostics to force errors within the 
controller module 10. Other bits control light emitting diodes ("LED") for diagnostics as well as functional 
code. Some of the LED bits used by the functional code are used during "warm swap" of attached SCSI 
devices. A more detailed description of the read diagnostic registers 80, 82 and write diagnostic registers 

20 84, 86 will be given hereinafter. Interrupts for the controller module 10 are run directly into the processor 28 
eight maskable dedicated interrupt lines and single non-maskable ("NMI") interrupt. These interrupts cover 
errors, host and device port events, timing intervals, communication, and power events. 

By reference to the following table, the controller module 10 memory allocation is shown. 

25 

TaJole 1: Controller Modal* 10 Memory Map/Table 
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50 Referring additionally to Fig. 2, an overview logic block diagram of a pair of tightly coupled storage 
controller modules 10A and 10B as previously described with respect to Fig. 1 is shown. In this illustration, 
a first controller module is labeled 10A and a second corresponding controller module is labeled 10B. 
Controller module 10A comprises a DRAB (master) 38A and a bus exchanger 100A. In like manner, 
controller module 10B comprises a DRAB (master) 38B and bus exchanger 100B. 

55 A cache module 1 1 0A is associated directly with controller module 10A. Cache module 11 OA comprises 
a DRAB (slave) 114A corresponding to DRAB (master) 38A of controller module 10A. In like manner, a bus 
exchanger 112A also forms a portion of cache module 11 OA. Cache module 11 OB includes a DRAB (slave) 
114B and bus exchanger 11 2B. 
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35 E ar o :~du," caches independent,. The cache modu.es 

uTa and 110B have two ports per module, one for each controller module 10A and 10B. 
1 . CDAL bus 26 (32 multiplexed data and address plus 4 byte parity) lines. 
2 CDAL bus 26 Control lines (for memory access only). 

Cache module 110 to controller module 10 information 

1 Cache module 110A "In Place". To both controller modules 10A and OB. 

2 Cache module HOB "In Place". To both controller modules 10A and 10B. 

3 Cache module 110A "Lock". To controller module 10A. 

a cache n 0t curren t.y used by controller module 10A will have this interrupt masked off. 
55 fXZ ":^^^Z^S to disable the other controHer module 1 0B or 
O r!esp^ronc?i has determined that the other controller module 10 is no ^rj*-. 
Further details of this operation will be more fully described hereinafter with respect to the DRAB 38. 
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2. DUART 48 lines. This includes data and flow control lines. Send, Receive and two Clear to Send lines 
make up this four line interface. 

3. Controller module 10A "In Place". To controller module 10B. For Failover software. 

4. Controller module 10B "In Place". To controller module 10A. For Failover software. 
5 Backplane to controller module 10 and cache module 110 interface. 

1. Controller module 10A Slot 1. To controller module 10A. A low input denotes controller module 10A, or 
SCSI ID7. 

2. Controller module 10B Slot 2. To controller module 10B. A high input denotes controller module 10B, 
or SCSI ID 6. 

10 3. Cache module 11 OA Address Range. Low level. 

4. Cache module 11 OB Address Range. High level. 

CONTROLLER MODULE 10 HARDWARE INITIALIZATION 

75 Controller module 10 initialization can occur via a power cycle, a software reset, or by a user pressing 

the "RESET" button on the Operator Control Pane! (OCP) of the controller module 10 as will be more fully 
described hereinafter. Any initialization is delayed 20 milliseconds (power going down, software reset, or the 
"RESET", button) to allow for cache module 110 management to take place and ensure customer data 
integrity. .Completion of boot means the controller modules 10A and 10B are ready to communicate with a 

20 host computer. 

Initialisation begins with the processor 28. The processor 28 begins booting by executing an internal 
self-test, and upon successful completion, reading an "Initial Boot Record", which contains the address of 
the hardware setup parameters and process control information. Following the initialisation of processor 28, 
software in the memory card is executed to perform. 

25 Minimum Integrity Self-Test Diagnostics . Enough tests are performed to ensure that the processor 28 bus 
hardware is good, the rest of the memory card is good, and that shared memory is good. This test must 
complete successfully in order for the module operating system ("EXEC") as will be more fully described 
hereafter to start up and for controller module 10 initialization to continue. Note that the shared memory 
must have the first megabyte working in order for the shared memory to be deemed good (the rest of the 

30 memory can have up to 16 faulty areas before being deemed bad). After the processor 28 has booted, 
controller module 10 "Self-Test" diagnostics are executed to verify: 

1. DUART 48 and timer 50 chips. 

2. Memory code card 40 image. 

3. DRAB 38 gate array. 

35 4. Shared DRAM buffer 30 memory. 

5. All processor 28 to shared memory bus access types. 

The 32KB SRAM with battery backup, called the NV journal RAM 46, is only tested after functional 
code has initialized because of several reasons. First it contains either default or user-enabled parameters 
for controller module 10 operation. Second, it may contain write journal information for RAID operations. 

40 This means a diagnostic for the write journal RAM 46 may only be run when the memory is "free", meaning 
no write journal information within the memory, and the parameter information shadowed in the DRAM 
buffer 30 memory behind the DRAB 38. When this situation occurs, the SRAM test may be called. A failure 
in this test will cause the controller module 10 to software reset. 

These tests are required to be run before the controller module 10 software image is copied from the 

45 memory code card 40 to shared DRAM buffer 30 memory. When these diagnostics have successfully 
completed, the controller module 10 software image is copied from the memory code card 40 to the shared 
DRAM buffer 30 memory (first megabyte of the memory). The image is then verified by controller module 
10 diagnostics and control is passed to the controller module 10 executive. The integrity of the DRAM code 
image is covered by the ECC that is incorporated in the DRAB 38 chip, thereby not requiring periodic 

50 checks of the DRAM code image. 

EXECutive Initialization . The EXEC initializes next, followed by a call to the Diagnostic Execution Monitor 
that completes the controller module 10 subsystem diagnostics. This is done to allow the controller module 
10 diagnostics to use the services provided by the controller module 10 executive to handle interrupts and 
errors. 

55 calls the remaining tests for a controller module 10 (including tests for the optional cache module 110, if 

installed). The tests that will/may call, depending on what is present on the module or backplane are: 

1 . VD Cache 44 for the processor 28. 

2. RAID FX chip 54 test. 

9 
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3. Device port processor 32 test(s). One to six port processors 32 are tested from the processor 28 and 
via diagnostic scripts. These tests are immune to all port activity, including bus resets. 

4. Host interface 12 Port test. 

5 Cache Module 110 Test (Slave DRAB 114 and cache memory). This test is special because of the 
5 possible large size of memory on a cache module 110. This test will be run in two parts. The first part 
will test the DRAB's 114, the addressing hardware of the whole memory, and 1MB of memory. The 
second part will be executed as a thread, in conjunction with the controller module 10 Cache Manager, to 
test 1MB chunks of the cache module 110 and turn them over to functional code as they have been 
tested. This cuts back on boot time drastically, while not severely limiting controller module 10 
w performance while testing memory. 

When these diagnostics have completed successfully, control is returned to the controller module 10 
executive. The controller module 10 executive completes its initialization (as more fully described 
hereinafter) and starts controller module 10 software. 

If any error occurs during any portion of the self-test, the controller module 10 light emitting diodes 
,s ("LED's") will be set with the specific error code of the failure. An error report will also be printed on the 
controller module 10 console, if the Dianostic Execution Monitor fielded the error for a diagnostic. 

Dual-Controller module 10A and 10B Booting 

Special actions occur during a boot of a dual-controller module 10A and 10B configuration. When the 
controller module 10 DUART 48 diagnostic has completed, it may send a message to the other controller 
module 10B to alert it that it is booting. When diagnostics are complete, more extensive messages might 
then be sent between controller modules 10A and 10B. The first message sent by diagnostics would then 
basically be an indication that the controller module 10A is booting up so that the controller module 10B 
won't decide that the controller module 10A is disabled and physically disable the first to boot controller 
module 10A from trying to come up as will be more fully described hereinafter. 

The controller modules 10A and 1 0B in a dual-controller module 10A, 10B configuration may or may 
not boot synchronously, the likely case being that they will have their boots skewed in time. This 
asynchrony is by design as controller modules 10A, 10B can have different boot rates depending on the 
amount of good memory found during boot, operator intervention on one controller module 10 by 
"RESET"ing one controller module 10 and so forth. The mechanism just descr.bed to alert a controller 
module 10A, for example, that is ahead in the boot process that there is a second controller module 10B 
trying to boot as well, is part of the failover scheme for the controller modules 10. A controller module 10B 
for example, that is detected by a "live" controller module 10A to be "dead" or not responding properly w.ll 
be physically disabled by the "KILL" line between the two controller modules 10A and 10B as will be more 
fully described hereinafter. 

If a controller module 10A, for example, cannot boot properly due to failure in diagnostics or software 
initialisation, the second controller module 10B will have to disable controller module 10A by using the KILL 
line and proceed to control the storage that controller module 10A was serving to the host computer(s). 

Dual-Controller Module 10A and 10B Interface 

Controller modules 10A and 10B communicate with each other in three ways: 
1 "Other Controller Module 10 in place" signals between controller modules 10A and 10B. Communica- 
tion will not be attempted unless this signal is asserted in a controller module 10 Read Diagnostic 



25 



30 



35 



40 



45 



2 UART signals. There are four signals shared between the controller modules 10A and 10B, the UART 
transmit and receive lines, along with Clear to Send lines from each controller module 10A and 10B. This 
is the main channel for communication between controller modules 10A and 10B and is used by 

so diagnostics and functional code to carry out this communication. 

3 KILL line This line is part of a circuit in the DRAB 38 gate array that allows the controller module 10A, 
for example when it receives an indication that the controller module 1 0B in a dual-controller module 10 
configuration is broken, to disable controller module 10B and take over its cache module 110B and 
storage devices it was previously serving to the host computer(s). This line is "failsafe" in that once a 

55 controller module 10A or 10B asserts this line, it prevents itself from being "killed" by the other controller 
module 10B or 10A respectively. In the event that both controller modules 10A and 10B think the other 
controller module 10B and 10A is broken (for example a bad UART connection), the first controller 
module 10 to set the KILL line survives and takes over control of all devices. 
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This interface is used by controller module 10 Failover software as well as controller module 10 
Configuration and controller module 10 Warmswap software. Controller module 10 Failover uses this 
interface to keep the other controller module 10 aware that it is alive and to take over the other controller 
module 10's job if it determines that the other controller module 10 is broken. Controller module 10 

5 Configuration software uses this interface to convey information about storage attached to it that is serving 
one or more host computer(s). This information is necessary in the event of failover, so a controller module 
10 can accurately perform the failover maneuver. Controller module 10 Warmswap software uses this 
interface to convey the current state of the DECstor/me enclosure (available from Digital Equipment 
Corporation) of LED's as well as the OCP Bus Quiesce Request buttons on each controller module 10. One 

10 controller module 10B, for example, will tell the other controller module 10A what state to put the buttons 
and LED's into. 

Cache Module 110 Initialization 

75 The cache modules 110 are initialized in one of two methods. The choice of method depends on 

whether the cache module 110 is a "read" cache or "write-back" cache and, if a write-back cache, if there 
is valid data in the cache module 110. 

If the cache module 110 is a read cache, then after controller module 10 diagnostics are completed, 
both the interface and memory of the read cache are tested by diagnostic software. 

20 If the cache module 110 is a write-back cache, then the controller module 10 software checks for valid 

data in the cache before diagnostics test the cache module 110. If there is valid data in the write-back 
cache, then non-destructive diagnostics will be run on the cache interface to the controller module 10 that 
"owns" the cache module 110. If there is not valid data in the write-back cache, then diagnostics will be run 
on the entire cache module 110. the same as done for a read cache module 110. 

25 

DECstor/me Subsystem Initialization with Controller Modules 10 

DECstor/me subsystem initialisation takes place at power-up with subsets of initialization activities 
occurring during normal operation of the subsystems due to failures, partial power outages, and errors. 
30 The total sequence for initialization is: 

1. Power ON. This causes all shelves to be reset, and devies contained in the shelves (drives, controller 
modules 10, caches modules 110 and so forth) to being caused to INIT. 

2. Entity self-test (if present). 

3. The controller modules 10 begin interrogating those devices it knows about (including the other 
35 controller module 10, if present). These INIT activities may include spinning up drives, in which case the 

controller module 10 may blink the light on the drive enclosure that the drive is spinning up within (for 
spin-down, the same blinking occurs as well). 

4. Controller module 10 begins data transfers on behalf of the host computers. DECstor/me is 
operational. 

40 

Controller Module 10 Lamps and Buttons 

The controller module 10 has an Operator Control Panel (OCP) module attached to the controller 
module 10 that is comprised of 7 buttons with embedded LED's in the buttons. The OCP is on the edge of 

45 the controller module 10 that faces out toward the door of the cabinet for easy access. As an example, the 
left-most LED may be green and the right-most six may be amber. The controller module 10 can light each 
LED separately via Write- Diagnostic Register (0) 84. The buttons can be read via Read-Diagnostic Register 
(0) 80. The left-most button is always designated as the controller module 10 RESET button. The usage of 
the LED's and the remaining six buttons varies depending on the state of the controller module 10, whether 

50 it is booting or in normal operation. 

During controller module 10 boot, the LED's are for diagnostic use in reporting errors and controller 
module 10 state. At the moment the controller module 10 boot begins, hardware turns on the left-most LED 
that is in the RESET button, with the remaining six LED's turned off. This denotes the 'boot* state. If at any 
point in the boot sequence a fault is found by the controller module 10 diagnostics, the right-most six LED's 

55 are written by the controller module 10 diagnostics with the error code denoting the fault that has been 
detected. This fault code will be flashed at 3Hz, with the left-most LED remaining on and not flashed. 

Following controller module 10 boot, the left-most LED is flashed at 1Hz. This denotes the "running" 
state of the controller module 10 and is the "heartbeat" of the machine. At this point the usage of the LED's 

11 
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and the right-most six buttons has changed from diagnostics to functional usage. At this time these six 
buttons become the OCP "Bus Quiesce Request" buttons for use in device warmswap or device warm 
insertion/removal. 

If at any time, a controller module 10 fails following boot, either during diagnostics or while running 
5 functional code, the controller module 10 will exhibit the following behavior in the LED's: 

1 . If the failure was in diagnostics, ALL LED's will be lit with no blinking. 

2. If the failure was in Functional code (due to a BUGCHECK loop), all but the right-most LED/button will 
be lit with no blinking. 

70 Device Warmswap 

The Modular Array Storage Subsystem affords the ability to warmswap attached storage devices. The 
methods for controller modules 10 differ based on the type of Host Interconnect on the controller module 10 
being swapped. There is a single method for attached storage warmswap. Controller module 10 attached 
75 device warmswap is used in conjunction with the right button(s) and LED's on the controller module 10. 

Controller Module 10 Attached Storage Warmswap 

Controller module 10 device warmswap is really a sequence of steps that allows an operator to safely 
20 remove a device and then later install another device. These steps are broken up into removal and insertion 
steps. This means that "safe" removal of a drive or addition of a drive is also possible for situations other 
than drive failure (and associated warmswap). The reasons for the steps below are for data integrity for 
other controller module 10 devices, and to reduce the chance of an operator causing a port to be unusable 
for a long period of time (which could disable several RAID sets). Only one port may be quiesced at any 
25 one time. 

In order to remove a drive, either controller module 10 or an operator must determine that the operation 
is necessary. The controller module 10 can determine that a device is bad by trying to use it and finding no 
response or too many errors from the device. An operator may do what he wants to determine a device 
must be removed. In either case, a sequence must be carried out in order to properly remove a device. 
30 The sequence for removal of a device is: 

1. The controller module determines a drive is bad or an operator uses the console to declare a drive 
bad. This causes the LED associated with that drivel port and drive enclosure to be illuminated by the 
controller module 10 software. This LED is embedded in one of the OCP "Bus Quiesce Request" 
buttons. 

35 2. An operator pushes the illuminated OCP "Bus Quiesce Request" button on the front of the controller 
module 10. This button must be held until such time as the controller module 10 software recognizes the 
button. The operator can tell that the controller module 10 has recognized the button by the fact that the 
remaining amber LED's will flash momentarily. If the LED in the button was already on, it stays on. If the 
LED in the button was off, it is now on. If the button is not held in long enough, or multiple buttons are 

40 pushed in quick succession, they will be ignored. If a button has been recognized by the controller 
module 10 software and a second button is pressed, the second button push will be ignored. 

The OCP "Bus Quiesce Request" signals to the controller module 10 Warmswap software to 
proceed with quiescing all activity on this port. The time taken to quiesce a port is given by the following 
equation: 

45 

Quiesce__Time = Operator__Delay_Factor + 
Time to run down IO 

Operator Delay Factor is currently set at 30 seconds. 

so The length of time required may as noted so that the actual time a SCSI bus is quiesced is kept to a 

minimum for RAID operations. The Time_to_run_down IO is dependent on port I/O load and can 

vary from almost zero to minutes. 

3. When the port is quiesced, both the port OCP "Bus Quiesce Request" button light and the 
DECstor/me drive enclosure light will be flashed, informing the operator that the device may be removed 

55 safely. At this point it is very important to note that ALL activity on the port has stopped. 

4. The operator removes the device. The physical action of removing the drive causes a circuit in the 
DECstor/me drive enclosure to send an interrupt to the controller module 10, alerting the controller 
module 10 that the bus may safely be used again, thereby minimizing operator involvement in the 
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warmswap process. The controller module 10 receives an interrupt (SP LO INT L) when this occurs, 

so that the controller module 10 can resume usage of the bus immediately. This feature reduces 
operator intervention to cut the time that the port is unusable. 
In order to insert a device, a similar procedure is employed. Steps 2-4 are used to insert the device 
with a minimum of operator intervention and port down-time. 

Warmswap is then the combination of a device removal followed sometimes later by a device insertion. 
If a situation occurs wherein an insertion or removal sequence is started and a "Bus Quiesce Request" 
has been initiated, an operator can cancel this by depressing the button a second time, waiting for the 
LED's to momentarily flash and then checking to see that the LED in the button has gone out. 

Controller Module 10 Swap 



This feature will involve a shutdown of the controller module 10 or cache module 110 to be swapped 
out prior to actual removal. This shutdown will involve software, unless the controller module 10 to be 
75 removed is completely non-functional. If the controller module 10 to be replaced is in a dual redundant 
configuration, the surviving controller module 10 may be shutdown temporarily when the controller module 
10 to be replaced is actually to be removed or a warmswap feature can be provided. 



20 



Diagnostic Registers 

The bits in Write Diagnostic Register (0) 84 are illustrated in Fig. 6a and further described in the 
following table. 



Table 2 



25 



Write Diagnostic Register (0) 84 


Bits 


Name 


Description 


0-15 


Reserved . 




16 


RESET CIRT 


When written with a 1, this bit asserts the reset input to the CIRT 






chips in the CI host port. This bit is initialized to 1. 


17 


MODULE RESET 7 


When written with a 0, this bit forces a reset to the module 






hardware, thus causing a re-boot of controller module 10. This bit 






initializes to a 1 . 


18-23 


SCSI[1:6]_ SNOOP_CLEAR 


When written with a 1 , these bits clear the corresponding interrupt 






request from the SCSI device ports to the processor 28. At the 






same time, they also clear the corresponding S[1 :6] SWAP bits in 






Read Diagnostic Register (1) 82. These bits are initialized to 1. 


24-29 


-LED [0:5] 


When written with a 0, these bits turn on the corresponding LED's 






in the OCP Bus Quiesce Request buttons, visible on the front of 






the controller module 10. These bits are initialized to 1 . 


30-31 


Reserved 





30 



35 



40 



45 The bits in Write Diagnostic Register (1) 86 are illustrated in Fig. 6b and further described in the 
following table. 



50 



55 
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Table 3: Write Diagnostic Register (1) 8 6 
Bits Name Description 

0 LOCK PRIMARY 

CACHE When written with a 1, this bit asserts the 

lock signal to the optional cache module 
110 normally associated with this 
controller module 10. The controller 
module 10 then has exclusive access to its 
cache module 110 and the alternate 
controller module 10 cannot access this 
cache module 110. This bit is initialized 
to 0. 



When written with a 1, this bit asserts the 
lock signal to the alternate optional cache 
module 110 not normally associated with 
this controller module 10. This controller 
module 10 then has exclusive access to the 
alternate cache module 110. It is assumed 
that the alternate controller module 10 has 
also been disabled via the KILL signal. 
This bit is initialized to 0. 

2 -TAG RESET When written with a 0, this bit asserts the 

~ reset input to the processor 28 I/D cache 

44 tag RAMs 70. This bit is initialized to 
0. 

25 

3-7 Reserved 

8 SCSI FAULT CLK This bit is used to clock data from bits 

~ — 10-15 of this register to the corresponding 

SCSI device port. The data in bits 10-15 
30 must be set up prior to writing a 1 to this 

bit. Data in bits 10-15 may be changed 
when a 0 is written to this bit . One bit 
is transmitted to the corresponding SCSI 
device port for each 1-0 combination 
written to this bit. This bit initializes 
to 0. 

35 

9 Fault Enable H When written with a 1, this bit enables the 

~ ~" fault clock to go to the SCSI flex strips 

in the drive enclosures for a specified 
port to set/clear drive enclosure LED ' s . 



1 LOCK_SECONDARY 
CACHE 

75 



20 



40 



45 



50 



55 
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10-15 S[1:6]_FAULT 
5 DATA 



10 



75 



25 



30 



35 



40 



45 



16 TIMER GATE 



17 -COMM RESET 



20 18 -Button Clear H 



19 -LED [6] 



20-23 -IPAR_ODD [0:3] 



24 



25 



-FX RESET 



-HOST_PORT 
RESET 



26-31 -SCSI [1:6] RESET 



This bit is initialized to 0, thereby 
disabling the clock. 



These bits, used in conjunction with bit 8 
of this register, drive the fault data 
lines in the corresponding SCSI device 
ports. These bits are used to generate a 
serial bit stream that is detected and 
decoded by individual devices to control 
LED's on the front of each device in a 
STORME shelf. These bits are initialized 
to 0. 

When written with a 1, this bit enables the 
timer 50 interrupt to the processor 28 . 
This bit is initialize to 0. 

When written with a 0, this bit resets the 
DUART 48, preventing communication to the 
console terminal or between controller 
modules 10. This bit is initialized to 0. 

When written with a 0, this bit enables Bus 
Quiesce Request and SWAP interrupts to 
occur and be seen. This bit initializes to 
a 1, causing a clear to any button or swap 
signal, so that interrupts will not occur 
and the signals will not be seen. 

When written with a 0, this bit causes the 
left-most LED in the OCP (Operator Control 
Panel) module to be illuminated. When 
written with a 1, the LED is extinguished. 
This bit initialized to a "0", causing the 
LED to be illuminated, denoting "Controller 
Module 10 Booting." 

When written with a 1, these bits force 
even parity on the corresponding bytes 0-3 
on the IBUS 19, in order to force parity 
errors and check parity logic by controller 
module 10 self-test software. These bits 
are initialized to 0 . 

When written with a 0, this bit asserts the 
FX RAID chip 54 reset input . It is 
initialized to 0. 



When written with a 0, this bit asserts the 
reset to the Host Port logic. It is 
initialized to 0. 

When written with 0, these bits assert the 
corresponding reset input to port processor 
32 SCSI controller chip. These bits are 
initialized to 0. 



50 The read diagnostic registers 80,82 contain the state of signals observed from the module hardware. 
Some bits are specific error bits, while others are status bits, either from the DECstor/me cabinet, the other 
controller module 10, or the cache modules. The rest of the bits are the individual interrupt bits that are 
ORed together to form combined interrupts to the processor 28. 

The status bits are used by functional software in order to control warmswap and failover between 
55 controller modules 10 and cache modules. The interrupt bits are used by functional software to find out 
exactly what interrupt occurred during normal operation. 

The bits in Read Diagnostic Register (0) 80 are illustrated in Fig. 7a and further described in the 
following table. 

15 
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Table 4: Read Diagnostic Register (0) 80 



Bits Name 



Description 



0 SECONDARY^ 

CACHE LOCKED 



70 



This bit indicates the state of the alternate 
controller module's 10 LOCK__PRIMARY_CACHE bit. 
When 1, it indicates that the other controller 
module 10 has locked its own primary optional 
cache module 110 (normal condition) . When this is 
true, this controller module 10 may not write a 1 
into its LOCK_SECONDARY_CACHE bit. When 0, this 
bit indicates that the other controller module 10 
does not have its own primary cache module 110 
locked. 



75 1 



20 



PRIMARY^ 
CACHE LOCKED 



25 



30 



2-7 -SCSI[1:6] IRQ 



8-9 S[l:2] SKELF_OK 



This bit indicates the state of the alternate 
controller module's 10 LOCK_SECONDARY_CACHE bit. 
When 1, it indicates that the other controller 
module 10 has its secondary optional cache module 
110 (ie, this controller module's 10 primary 
cache) locked. This indicates that the other 
controller module 10 has detected an abnormal 
condition within the subsystem. When 0, this bit 
indicates that the other controller module 10 does 
not have this controller module's 10 primary cache 
locked (normal condition) . 

When 0, these bits indicate which port processor 
32 interrupted the processor 28. These bits are 
OR'd together to form the SCSI Port interrupt to 
the processor 28. 

When 1, these bits indicate that the power and 
cooling for the specified shelves supporting SCSI 
ports 1 and 2 are within satisfactory limits. 



35 



40 



45 



50 



55 
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10-15 -SCSI_P0RT_ 
SNOOP_ 

INTERRUPT [1:6] 



16-17 S[3:4] SHELF OK 



18-23 BUSJ2UIESCE 
10 REQUEST ( 1 : 6T 



75 



20 



24-25 S[5:6] SHELF OK 



26 -BATTERY FAIL 



27 MANUAL INIT 



25 28 



29 



30 



CODE_STORE_ 
WRITE PROTECT 



-CODE_STORE_ 
CARD PRESENT 



When 0, these bits indicate which port processor 
32 asserted the low priority SCSI Port interrupt 
to the processor 28 . 

When 1, these bits indicate that the power and 
cooling for the specified shelves supporting SCSI 
ports 3 and 4 are within satisfactory limits. 



When 1, these bits indicate that the corresponding 
OCP Bus Quiesce Request button on the front of the 
module has been pushed- No debouncing is provided 
in hardware. 

When 1, these bits indicate that the power and 
cooling for the specified shelves supporting SCSI 
ports 5 and 6 are within satisfactory limits. 

When 0, this bit indicates that the battery 
associated with the write journal RAM 4 6 has 
failed. 

When 1, this bit identifies to software in the 
controller module 10 that a reset that just 
occurred to the module was caused by manual 
intervention by an operator pressing the "RESET" 
button on the front of the controller module 10. 



When 1, this bit indicates that the code card 4 0 
is write protected (this is for SRAM code cards 
40) . 



When 0, this bit indicates that the code card 4 0 
is inserted into the controller module 10. 



35 



30 -CODE_STORE 
ALARM 



31 POWER OK 



40 



When 0 f this bit indicates that the battery inside 
the code card 40 (SRAM only) is failing. 

When l f this bit from the dc power supply 
indicates to the controller module 10 software 
that power is ok. for the module. In the event of 
an NMI interrupt to the processor 28, if this bit 
is 0, it indicates a power fail has occurred; if 
this bit is 1, it indicates that a DRAB 38- 
detected serious error has occurred (such as 
double bit ECC, parity, etc.) 



45 The bits in Read Diagnostic Register (1) 82 are illustrated in Fig. 7b and further described in the 
following table. 



50 



55 
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Table 5: Read Diagnostic Register (1) 82 



Bits Name 
0 SLOT 



70 



Description 

This bit is input from the backplane and is 
readable anytime after INIT of the controller 
module 10. This bit indicates to a controller 
module 10 whether it is controller module 10A or 
controller module 10B. It is also used to 
determine which SCSI ID the controller module 10 
should assume for its device ports. Controller 
module 10A and SCSI ID 6 are assumed if the SLOT 
bit is 0. Controller module 10B and SCSI ID 7 are 
assumed if the SLOT bit is 1. 



75 1 



20 



25 



-SECONDARY 
CACHE IN 



-PRIMARY 
CACHE IN 



-OTHER CNTRLR IN 



4-5 Reserved 

6 -CACHE_EVENT 
30 INTERRUPT B 



When 0, this bit informs a controller module 10 
that the optional cache module 110 normally 
associated with the other controller module 10 in 
a dual-controller, dual-cache configuration is m 
place in the backplane. 

When 0, this bit informs a controller module 10 
that its primary optional cache module 110 is m 
place in the backplane. 

When 0, this bit indicates that the other 
controller module 10 is in place in the backplane 
in a dual-controller configuration. 



When 0, this bit indicates that cache module 110 B 
is asserting its Event Interrupt signal. 



35 8 



-CACHE_EVENT_ 
INTERRUPT_A 

-TIMER_ 
INTERRUPT 



When 0, this bit indicates that cache module 110 A 
is asserting its Event Interrupt signal. 

When 1, this bit indicates that the timer 50 
interrupt is asserted. 



40 



9 -COMM_ 

INTERRUPT 



10-15 -S[l: 6] SWAP 



45 



When 0, this bit indicates that the DUART 4 8 
interrupt is asserted. 

When 0, these bits indicate that a device has been 
inserted into or removed from the corresponding 
SCSI device port OR that the corresponding Bus 
Quiesce [1:6} button has been depressed. These 
two signals are logically "OR" ? d. These bits are 
reset to 1 by writing a 1 in the corresponding 
SCSI [1 : 6]_SNOOP_CLEAR bit in Write Diagnostic 
Register (1) 86. 



16-31 Reserved 



50 



55 



Controller module 10 Interrupts and Relative Priority 

Controller module 10 interrupts are direct inputs to the processor 28. Since there are more interrupt 
sources than interrupt inputs, several interrupt sources are ORed together to reduce the total number of 
inputs. As a result, in some cases, software is required to read diagnostic registers 80-86 to determine 
exactly which interrupt actually occurred. 

These interrupts have the following priority (from highest to lowest down the list): 
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Table 6: Processor 2 8 Interrupts 



Processor 
Interrupt Name 



Description 



NMI 



w 



15 



SYSERR The SYSERR interrupt is the logical OR of the DRAB 

38 Error Interrupt (serious, not event) , the two 
possible Cache Error interrupt lines, and 
POWER OK. The first three are for serious module 
errors, when the processor 28 is accessing one of 
the memories. The last is for power fail. 

CLOCK The clock interrupt is for the Controller Module 

10 Executive to keep track of elapsed time. This 
is not the same clock for Time of Year (TOY) that 
may be on the NBUS 21. 

HOST PORT This interrupt is for the Controller Module 10 
Host Port in the event of a serious error. 



20 



25 3 



30 



HOST_PORT_ 
LOW PRIORITY 



FX 



SCSI PORT 



SCS I_PORT_LOW 
PRIORITY 



35 



This interrupt is for the Controller Module 10 
Host Port for normal processing support from the 
processor 28 . 

This interrupt is for the FX RAID chip 54 for 
error detection. 

This interrupt is the logical OR of the six SCSI 
port processor 32 error interrupts. This 
interrupt is asserted when the SCSI processor 
halts on a significant event. Which SCSI device 
port actually asserted the interrupt line is found 
in Read Diagnostic Register (0) 80. 



This interrupt is the logical OR of the six SCSI 
port processor 32 outputs used for normal 
processing support from the processor 28 and the 
six inputs from the SCSI device ports that 
indicate a device has been inserted or removed. 



COMM/ TIMER 



40 

INT This interrupt is the logical OR of the interrupts 

generated by the DUART 48 and the timer 50. 

0 DRAB_I NTERRUP T This interrupt is the logical OR of the DRAB 38 

event interrupt and the two optional cache event 
45 interrupts . This is for non-fatal errors such as 

single-bit errors that have been corrected or 
battery status for write-back cache module 110s. 



50 With reference additionally now to Fig. 4, the logic block functionality of the DRAB 38 for use in 
controller module 10 is shown. DRAB 38, which is coupled to MBUS 24 comprises, in pertinent part, a 
DRAM control block 130 and an associated ECC block 132 and address latch 134. DRAM control block 130 
provides a number of control line 96 outputs to the DRAM buffer 30 illustrated in Fig. 1. Control lines 96 
comprise row address strobe ("RAS") line 158, column address strobe ("CAS") 160 write enable ("WE") 

55 line 162 and output enable ("OC") lines. In addition, DRAM control block 130 provides a row/col line 136 
input to address latch 134. ECC block 132 is coupled to DRAM control block 130 and provides and ECC 
output on ECC lines 92 to DRAM buffer 30. Address latch 134 provides an output on DADDR lines 94. 
Inputs to ECC block 132 and address latch 134 are taken from MBUS 24 on address/parity 
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("ADDR PARITY") line 138. ECC block 132 receives an additional input on syndrome line 140 from 

decoder 142 coupled to MBUS 24 by means of data correction register 144. Decoder 142 also provides an 
output labeled memory error ("MEM ERROR") on line 156 for input to CSR register/diagnostic para- 
register 146 which is also coupled to MBUS 24. 

5 Address latch and decode logic 148 is coupled to ADDR PARITY line 138 and provides outputs to 

CSR register/diagnostic error register 146, l/D cache control 150 and bus control logic 152. I/D cache 
control 150 receives as inputs the signals on match A line 72 and match B line 74 as well as a bank input 

signal on bank line 170. I/D cache control provides outputs on C OE line 172, C WE line 174, LRU WRT 

line 176 and CACHE WRT line 178. 

w Bus control logic 152 is bidirectionally coupled to control bus 88 here illustrated separately as IBUS 16 
control ("CTL") bus 104 and CDAL CTL bus 88. Further, bus control logic 152 is bidirectionally coupled to 
NBUS 21 control lines 90 as shown. Additionally, DRAB 38 comprises an arbitration logic providing outputs 
on bus grant ("BG") line 164 and NBUS 21 bus master arbitration winner ("ARBWIN") line 168. Arbitration 
logic 154 receives as input a signal on bus requests ("BR")/cache select line 166. 

15 

DRAM BUFFER 30 CONTROL AND ECC LOGIC 

The DRAB 38 chip's DRAM buffer 30 controller is programmable. High performance burst read 
operations of 4 and 8 LONGWORDS in length are possible. The read burst length is programmable for each 
20 NBUS 21 requester. Port processor 32 and port processor 34 SCSI control chips will work with 4 or 8 
LONGWORD bursts. The host interface 12 will only generate single LONGWORD reads or 4 LONGWORD 
burst accesses to buffer memory. The FX chip 54 XOR engine will generate single LONGWORD, and either 
4 or 8 LONGWORD burst accesses. 

Burst accesses generated by the processor 28 port can terminate on any longword. The BLAST signal 
25 indicates that the current longword transfer is the last for the bus access. 

The internal ECC detection and correction can be turned off. By writing a one to the CSR:ECCI bit, ECC 
error checking and correction is inhibited. Clearing this bit enables both error detection and single bit error 
correction. 

The DRAB 38 ECC code includes ROW, COLUMN, and HI address bits to allow some detection of 
30 addressing errors in the DRAM buffer 30 array. The COLUMN bit in the ECC code is the even parity bit of 
address bits <11:2>. The ROW bit is the even parity of address bits <21:12>, and the HI bit is the even parity 
bit of address bits (25:22). 

The DRAB 38 ECC code is shown in the following table. 

Table 7: DRAB 38 ECC CODE 



ECC MATRIX 
cod« d«pth #7; 
cod«_w±dth #4 3; 



45 





HRC 
IOO 
WL 


ccc 

000 
654 


cccc 
oooo 

3210 


DDDD 
3322 
1096 


DDDD 
2222 
7654 


DDDD 
2222 
3210 


DDDD 
1111 
9876 


DDDD 
1111 
5432 


DDDD 
11 OO 
1098 


DDDD 
0000 
7654 


DDDD 
OOOO 
3210 


CHECK 
BIT 0: 


Oil 


OOO 


0001 


0100 


0100 


1100 


0111 


0011 


1011 


1011 


1000 


CHECK 
BIT 1: 


101 


000 


0010 


1010 


0010 


0010 


1011 


1101 


0101 


1101 


O100 


CHECK 
BIT 2: 


in 


000 


O10O 


0001 


1001 


0001 


1100 


1110 


1110 


0110 


0011 


CHECK 
BIT 3: 


000 


000 


1000 


1000 


1000 


1000 


1111 


1000 


1000 


1000 


1111 


CHECK 
BIT 4: 


111 


001 


oooo 


1101 


1011 


1110 


0110 


0001 


O01O 


0100 


1001 


CHECK 
BIT 5: 


110 


010 


0000 


mo 


1101 


1011 


1001 


0100 


O0O1 


0010 


0110 


CHECK 
BIT 6: 


111 


100 


oooo 


1011 


1110 


1101 


1111 


0010 


0100 


0001 


oooo 



55 



20 

BNSDOCID: <EP 0632379A2_t„> 



EP 0 632 379 A2 



REGISTER DEFINITIONS 
REGISTER ADDRESS MAP 

5 All internal registers are accessed through the MBUS 24 port. The base address of the DRAB 38 internal 
registers is determined by a chip select signal provided by the module. 

DRAB 38 setup register 

10 Table 8: DRAB 38 setup Register 



DRAB base + 

4 4 4 4 - 

+ 

00 | Setup |BT 

+ _ + 4 — 

31 14 23 IB 15 08 07 00 



20 



25 



30 



35 



40 



45 



50 



55 
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DRAB 38 CSR register 

Table 9: DRAB 38 Control and Status Register 



DRAB bale + , 

4 + + 

+ m 

M | Control and Status P 1 

\ * 



10 _ + 

31 24 23 IB 15 0,107 °° 

Table 10: DIAG diagnostic control Register 

75 

DRAB but + . 



DIAG control 



31 2*23 

Table 12: Error Address Register 



Table 13: Error Data Register 

35 - 



04 1 



Error D*t» 



40 

Table 14: Error Region Register 



Table 15: Region Setup Register 

50 



RW 



04 I _ , 

31 — ^ 23 i 6 i5 08 07 00 

20 Table 11: DRAM Buffer 30 Error Register 



DRAB but + | f 

"* ' ~~ RW 

I Dram Error 

25 ^23 16 15 OB 07 00 



30 DRAB + j ( , ► 

04 , Error Addw RW 

3l ~~ 24 23 16 13 0« 07 00 



DRAB b*jc 4 



31 24 23 16 13 0807 00 



DRAB but * A 

45 04 I Error Regioo* RW 

3J 24 23 1^15 0«O7 00 



DRAB buc + m ¥ 

04 | Region Setup RW 

31 24 23 16 15 08 07 00 
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Table 16: I/D CACHE 44 HIT COUNTer 



DRAB \ 



04 I 



I/D CACHE HIT C0UNroi:0> 



31 24 23 16 15 08 07 

Table 17: I/D CACHE 44 MISS COUNTer 



10 



VD CACHE MISS COUNTOl :Q> 



RW 



00 



75 

REGISTER DESCRIPTIONS 
20 DRAB 38 SETUP REGISTER (SETUP) (R/W) 

This register must be written with a LONGWORD operation on a LONGWORD boundary. 



25 



Table 18: DRAB 38 SETUP Register bit definitions 



30 



35 



DRAB baa* + 

3322222222221111111111 

1O987654321O9876543210987654321O 
Power up dofault : 

|0|C|0|7|O|0|3JC| 
|0 0 0 0|1 1 O 0|0 0 O 0(0 11 1|0 0 0 0|0 0 0 0|0 0 1 111 1 O 0| 
I I I I I I I I I 

•*--+-+-+--+-+-+-+-+- +-+-+-+-+-+-+-+-+-+-+-+-+-+ 
00 | | I III RErR INTERVAL | |xf | | I I I I I I NBTO | | | 



♦—+-+-+-+-+-+-+-+-4— +-+-+-+-+-+-4 

\ / \_/ 

I I 

| RETR_TYPE<1:0> 
* — NBTJRST 
— B AN1CCONT I G 
— KILL ENABLE 



40 



-+-+-+-+-+-+-+-+-+-+-+-+-+-+ 
I II 
I I I 

| | * -MEM_SIZE 

| * — DISABLE ECC 

■— DIS_APAR_CHK ~ 

PROCESSOR 28 HI PRI 

IPAR DISABLE 

TEST MODE ENABLE 

CACHE I ONLY ( I ONLY) 

CACHE D ONLY (DONXY) 

*— HOST PORT HI PRIORITY (HPHI) 

REFRESH COUNTY PRESET (RCP) 

x 

HOST PORT PRIORITY ENABLE (HPPE) 



SETUP MEM_SIZE - Bit: <0> Type: RW 

45 

Selects the size of the memory, and is init to 0. 

0 - 1 Meg x 4, 

1 - 4Meg x 4. 

so SETUP DISABLE ECC - Bit: <1> Type: RW 

ECC checking is disabled when this bit is set. Correct ECC is still generated and written to memory. It 
is initialized to 0. 



55 SETUP NBTO<3:0> - Bits: <5:2> Type: RW 

These bits determine the timeout value for a processor 28 to port processor 32 bus transaction. The 
timeout will occur when the TA or TEA signal is not asserted within the SETUP:NBTO time. The value 
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15 



20 



determines the number of l60nS intervals from Transaction Start to expiration. It is initialized to OF (1111). 
SETUP DIS_APAR__CHK - Bit: <6> Type: RW 

5 Address parity checking, i.e., address parity error detection, on memory accesses is disabled when this 

bit is set. It is initialized to 0. 

SETUP PROCESSOR 28_HI__PRI - Bit: <7> Type: RW 

io Setting this bit to a ONE places the processor 28 memory requests above all others except host port 

When set to a ZERO, the priority of processor 28 is equal to all NBUS 21 requesters. This bit is initialized 
to ZERO. 

SETUP IPAR DISABLE - Bit: <8> Type: RW 

Setting this bit to a ONE causes the DRAB 38 to ignore IBUS 19 parity errors. This bit is initialized to 
ZERO. 

SETUP TEST MODE ENABLE - Bit: <9> Type: RW 

Setting this bit to a ONE places the DRAB 38 in test vector mode. This delays assertion of the NBUS 
21 control lines for 1/2 clock cycle to allow device test vectors to stimulate the NBUS 21 control logic. This 
bit is initialized to ZERO. 

25 SETUP CACHE I ONLY - Bit: <I0> Type: RW 

This bit sets the ID CACHE control logic to cache instruction fetch memory cycles only. This bit is 
initialized to ZERO. 

30 SETUP CACHE D ONLY - Bit: (11) Type: RW 

This bit sets the ID CACHE control logic to cache data fetch memory cycles only. This bit is initialised 
to ZERO If both CACHE D ONLY and CACHE I ONLY are both set or both reset (and the 

CACHE ENABLE bit in the DRAB 38 Diagnostic CSR register is set), the ID cache will cache both 

35 instruction and data memory cycles. 

SETUP HPHI - Bit: <I2> Type: RW 

The HOST PORT HIGH PRIORITY bit sets the arbitration priority for the host port. If this bit is set to a 
40 one, and HPPE is set, the host port will be the highest priority requester on the NBUS 21. This bit is 
initialized to ZERO. 

SETUP RCP - Bit: <I2> Type: W 

45 Writing one to the REFRESH COUNT PRESET initializes the refresh counter to known value. This is a 
read only bit. 

SETUP HPPE - Bit: <15> Type: RW 

The HOST PORT PRIORITY ENABLE bit enables the HPHI bit. If this bit is set to one, the HPHI bit sets 
the arbitration priority for the host port requester on the NBUS 21. If this bit is a zero, the HP ARB HI — L pin 
on DRAB 38 sets the host port priority. This bit is initialized to ZERO. 

SETUP REFR INTERVAL<7:0> - Bits: <23:16> Type: RW 

Fixes refresh rate to be equal to (640nS x REFR INTERVAL + 320nS). 

Typical values are as follows. 

For a 16mS refresh period for Imeg x 4 DRAM 98 or for a 64mS refresh period for 4Meg x 4 

24 
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DRAM 98: 

I refresh/1 5.6uS -> REFR-INTERVAL = 23 

5 SETUP REFR-TYPE<l:0> - Bits: <25:24> Type: RW 

These bit are initialized to Ras only refresh mode. They can be setup as defined below. 

00 - Ras only refresh mode, 

01 - Read scrubbing mode, 

io 10 - Refresh write - See NOTE, 

I I - Cas before Ras low power refresh mode. 

NOTE: The '10* bit pattern initializes the memory array. The pattern written is the last pattern read. For 
example, to initialize memory with AAAAAAAA#16, write AAAAAAAA#16 to address X, then read address X. 
Next set the REFR WRITE bit to initiate the memory pattern fill. 

15 

SETUP NBURST<2:0> - Bits: <28:26> Type: RW 

These bits set the longword burst rate on the NBUS 21. They are initialized for a 4 longword burst. The 
bits are equal to the desired number of longwords/burst - 1 . Examples follow: 
20 000 - 1 longword burst, 
011-4 longword burst, 
111-8 longword burst. 

SETUP BANK CONFIGd :0> - Bits: <30:29> Type: RW 

25 

Selects number of banks of memory, and is init to 0. 
00. 11 - 1 bank, 
01 - 2 banks, 
10-4 banks. 

30 

SETUP KILL ENABLE - Bit: <3l> Type: RW 

The KILL line is a tristate line interconnecting two controller modules 10. The KILL line is pulled up on 
both controller modules 10A and 10B. When this bit is set, and CSR:KILL is reset, the DRAB 38 chip is 
35 enabled to drive the KILL line. While enabled, the DRAB 38 ignores values on the KILL line (cannot be killed 
itself). 

Once this bit is set, and the CSR:KILL bit is reset, the KILL sequence is started. If the following 
processor 28 transaction is a write to the DRAB 38 CSR to set the KILL bit, the KILL line will be asserted, 
resetting the other controller module 10. The KILL line will be de-asserted when the KILL ENABLE bit is 
40 cleared. 

If KILL ENABLE is set, and the next processor 28 bus transaction is not a write to set the CSR:KILL bit, 
the KILL ENABLE bit is reset 

DRAB 38 CONTROL and STATUS REGISTER (CSR) (R/W) 

45 

This register must be written with a LONGWORD operation on a LONGWORD boundary. 
CSR WD-PERR<3:0> - Bits: <3:0> Type: RW 

50 On any transaction on the MBUS 24 where a write data parity error has occurred, the corresponding bit 
for each byte in error is set. The ES bit is also set and an interrupt to the processor 28 occurs. The error 
address, data and region are saved in the EAR, EDR and ERR registers, respectively. Write 1 to clear. 
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15 



20 



25 



Table 19: DRAB 38 CSR Register bit definitions 



DRAB bas« + 
3 3 2 2 
10 9 8 



2 2 
6 5 



04 



I 

+-+-+-+-■ 

I I lx|x 



2 2 2 
4 3 2 
! 

-+-+-+-+-+-+• 
x| | | I I I 
+-+-+-+-+-+-+ 



2 2 
1 0 



1111 
9 8 7 6 



I 



I I I I I 



+-+-+• 
I I i 
+-+-+-+ 



I I 

i i 

I 1 

I < I I 
I 
I 



I I t 



I 

I 
I 

I ! 
I I 



I I 
I I 
I 
I 
I 



I I I I 
I I I I 



I 
I 
I 
I 

I I 
! I 



I I 
I I 
I i 
1 I 
I 
I 
I 



I 
I 

I I 
I I 



ill) 
I I I I 
1111 
! I I I 

i i r- 

i i * — 



i i i 
i i i 



i 

i 

i i i 



t i i 

i i i 

i i i 

i i i 



i i 



i i i 
tiii 

i i t 

i i i 

i i i 

i i i 

i i i 

i i > 
i t 



i 
i 

i i i 
i t i 



i 
i 

t i 



8 7 
I 



6 5 



4 3 
1 



1 0 

I 



| | | | | |WD_PERR| 
-+-+-+-+-+-+-+-+-+-+ 
lilt 

ID_P£KR 

| | * CACHE_RW 

J * CACHE_BYT£_WRT 

CACHE_CYCIX 

. — BYTE_WRT 

£|f 

- — ARBWIN<0> 
AFJ3WIN<1> 



■ ARBWIN<2> 

IBUS 19_CYCLE 

ADDR_PERR 

NXM 



XN__BUSTO 
MBIT_ERR 
SBIT_CRRCTN 
MOLT SBC 



SCRB_CYC CMPLT 
NB TEA "~ 

PORT PROCESSOR 32_ARB_DIS 
IRESET 



CACHEA_NMI 
CACH£B_NMI 
CACHE TIMEOUT 



KILL 

ES 

CSR ID PERR - Bit: <4> Typ« : 



RW 



30 

On any transaction on the IBUS 19 where a data parity error has occurred, this bit is set. This covers 
data transactions for the following regions as programmed in the DRAB 38 REGION SETUP REGISTER: 
BUFFR, ITON, CACHEA, CACHEB, CAACSR. CABCSR 

This bit is the ORed output of the individual byte parity errors on the IBUS 19 parity generators and 
checkers. No information is available as to which byte the parity error was found. The ES bit is alsc > set and 
an interrupt to the processor 28 occurs. The error address, data and region are saved in the EAR, EDR and 
ERR registers, respectively. Write 1 to clear. 

CSR CACHE_RW - Bits: <5> Type: RW 

This bit is set when the ES bit is set on a transaction to the 'read cache 1 . The operator will know what 
type of 'read cache' transaction (read or write) was in process when any error which sets the ES bit 
occurred. This data will only be valid when the ES bit is set. Any value read when the ES bit is not set is 
not valid. The field is cleared when the error bit is written to clear. 

CSR CACHE BYTE WRT - Bits: <6> Type: RW 

This bit is set when the ES bit is set on a transaction to the "read cache'. The operator will know that a 
•read cache' byte write transaction was in process when any error which sets the ES bit occurred. This data 
so will only be valid when the ES bit is set. Any value read when the ES bit is not set is not valid. The field is 
cleared when the error bit is written to clear. 



35 



40 



45 
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CSR CACHE CYCLE - Bits: <7> Type: RW 

This bit is set when the ES bit is set on a transaction to the 'read cache'or 'wite-back cache'. The 
operator will know that a 'read cache' transaction was in process when any error which sets the ES bit 
occurred. This data will only be valid when the ES bit is set. Any value read when the ES bit is not set is 
not valid. The field is cleared when the error bit is written to clear. 
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CSR BYTE WRT - Bits: <8> Type: RW 

This bit is set when the ES bit is set on a buffer memory transaction. The operator will know that a 
buffer memory byte write transaction was in process when any error which sets the ES bit occurred. This 
5 data will only be valid when the ES bit is set. Any value read when the ES bit is not set is not valid. The 
field is cleared when the error bit is written to clear. 

CSR RW - Bits: <9> Type: RW 

w This bit is set when the ES bit is set on a transaction to the buffer memory. The operator will know what 

type of buffer memory transaction (read or write) was in process when any error which sets the ES bit 
occurred. This data will only be valid when the ES bit is set. Any value read when the ES bit is not set is 
not valid. The field is cleared when the error bit is written to clear. 

75 CSR ARBWIN<2:0> - Bits: <12:10> Type: RW 

These bits will hold the value of ARBWIN 168 when any error which sets the ES bit error occurs on an 

NBUS 21 transaction (IBUS 19 CYCLE = 0). The operator will then know who the bus master was. This 

data will only be valid when the ES = I and IBUS 19__CYCLE = 0. The field is cleared when the error bit is 
20 written to clear 

CSR IBUS 19_CYCLE - Bit: (13) Type: RW 

This bit is set when the ES bit is set on an IBUS 19 transaction. The operator will know that an IBUS 19 
25 transaction was in process when any error which sets the ES bit occurred. This data will only be valid when 
the ES bit is set. Any value read when the ES bit is not set is not valid. The field is cleared when the error 
bit is written to clear. 

CSR ADDR PERR - Bit: <I4> Type: RW 

30 

When an address parity error has occurred on a DRAM 98 access, this bit is set. The ES bit is also set 
and an interrupt to the processor 28 occurs. The error address, data and region are saved in the EAR, EDR 
and ERR registers, respectively. Write 1 to clear. 

35 CSR NXM - Bit: <15> Type: RW 

This bit is set when nonexistent memory has been addressed. The ES bit is also set and an interrupt to 
the processor 28 occurs. The error address, data and region are saved in the EAR, EDR and ERR registers, 
respectively. Write 1 to clear. 

40 

CSR IN-BUSTO - Bit: <I6> Type: RW 

This bit is set on a bus timeout from an IBUS 19 to NBUS 21 transaction. The ES bit is also set and an 
interrupt to the processor 28 occurs. Write 1 to clear. 

45 

CSR MBIT ERR - Bit: <17> Type: RW 

When a multiple bit error on data read from DRAM 98 has occurred, this bit is set. The ES bit is also 
set and an interrupt to the processor 28 occurs. The error address, data and region are saved in the EAR, 
so EDR and ERR registers, respectively. The DER register values are also captured. Write 1 to clear. 

CSR SBIT CRRCTN - Bit: (18) Type: RW 

When a single bit correction to data read from one of DRAM's 98 has occurred, this bit is set. It does 
55 NOT cause the ES bit to be set, but if the DCR:SBIT_CR_Jv1ASK bit is set, will cause an interrupt to the 
processor 28. The DER register values are captured. Write 1 to clear this bit. 
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CSR MULT_SBC - Bit: <19> Type: RW 

When a single bit correction to data read from one of DRAM's 98 has occurred more than once, this bit 
is set it does NOT cause the ES bit to be set and does NOT generate and interrupt. The error does not 
change the error information in the DER which was captured for the first single bit error correction. Write 1 
to clear this bit. 

CSR SCRB__CYC__CMPLT - Bit: <20> Type: RW 

When a scrubbing cycle of DRAM's 98 is complete, this bit is set. It does NOT cause the ES bit to be 
set. This is just update information that can be read at the operator's convenience. This bit should be 
checked to make sure refreshes to the DRAM 98 are continuing. Write 1 to clear the bit. 

CSR NB TEA - Bit: <21> Type: RW 

When the DRAB 38 chip sees a TEA on the NBUS 21 , this bit is set. It causes the ES bit to be set and 
an interrupt to the processor 28 occurs. Write 1 to clear. 

CSR PORT PROCESSOR_ARE_DlS - Bit: <22> Type: RW 

This bit is set when the ES bit is set from the errors described in the CSR. It will NOT be cleared when 
the initial error causing the ES bit to be set is cleared. It's function is to disable NBUS 21 transactions i until 
the processor 28 can process the interrupt and assess the situation. Write 1 to clear and enable NBUS 21 
transactions. 

CSR IRESET - Bit: <23> Type: W 

Writing a one to this bit will reset all the DRAB 38 internal state machines and internal error state 
registers. The internal reset lasts for 8 BCLK cycles. Writing a zero to this bit has no effect. 

CSR CACHEA NMI - Bit: <24> Type: RW 

When an unrecoverable error on the Cache A board is detected, this bit is asserted I in- *e JJaster 
CONTROLLER MODULE 10 DRAB 38. An NMI to the processor 28 occurs. The ES and PORT PROOtb- 
SOR ARB DIS bits are also set. Write 1 to clear the error on the Cache A DRAB 1 14 CSR first BEFORE 
writing 1 to clear this bit in the Master CONTROLLER MODULE 10 DRAB 38. 

CSR CACHEB NMI - Bit: <25> Type: RW 

When an unrecoverable error on the Cache B board is detected, this bit is asserted in^ the Master 
CONTROLLER MODULE 10 DRAB 38. An NMI to the processor 28 occurs. The ES and PORT PROCES- 
SOR ARB DIS bits are also set. Write 1 to clear the error on the Cache B DRAB 114 CSR first BEFORE 

writing 1 to clear this bit in the Master CONTROLLER MODULE 10 DRAB 38. 

45 CSR CACHE TIMEOUT - Bit: <26> Type: R/W1C 

Timeout on CDAL bus 26 access to cache. 

CSR KILL - Bits: <30> Type: RW 

50 

CSR ES - Bit: <3I> Type: RW 

This bit is a summary of most of the errors in the CSR register. It is set when the following CSR errors 

occur: WDAT PERR, ID PERR, ADDR PERR, NXM, IN_BUSTO, MBIT ERR, NB_TEA. When this bit 

55 is set, an interrupt to the processor 28 occurs. 

NOTE- The interrupt is a Non-Maskable Interrupt when the error occurred during an IBUS 19 cycle. When 
the error occurred during an NBUS 21 cycle or a DRAM 98 refresh, a regular interrupt to the processor 28 
is asserted. 

28 
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DRAB 38 DIAGNOSTIC REGISTER (DCR) (R/W) 

This register must be written with a LONGWORD operation on a LONGWORD boundary. 
5 DCSR BG<7:0> - Bits: <7:0> Type: R 

This field provides observability of the Bus grant lines. 



10 



Table 20: DRAB 38 Diagnostic GSR 
Register bit definitions 



DRAB base + 

332222222222111111111 1 

10987654321098765432109876543210 

75 I I I I I I I I I 

-+-+-+-+-+-+ 

08 | 1 DIAG CKBITS | FBPE I I I I I BR<7 : 0> | BG<7 : 0> | 

111 

| | * BB tost 

| ARB TEST 

• SBIT_CR_MASK 

— CACHE_ENABLE 

' SUBS CHKBIXS 



20 



25 DCSR BR<7:0> - Bits: <15:8> Type: RW 

This field provides observability of the Bus request lines, and when DCSR:ARBTEST is set, writes to 
this register provide the pattern applied to the arb logic. 

30 DCSR BB TEST - Bit: <16> Type: RW 

This bit provided observability of the NBUS 21 Bus Busy (BB L) signal. When DSCR:ARBTEST is set, 

writes to this bit provide the value the arb logic sees for BB L. 

35 DCSR ARBTEST - Bit: <17> Type: RW 

Setting this bit puts the DRAB 38 in ARBTEST mode. In this mode the NBUS 21 Bus request signals 
are not seen by the DRAB 38/NBUS 21 arbitration logic. The arbitration logic is stimulated by writes to the 
BR 166 and BB fields in this register. The resulting BG 164 signals can be observed by reading the BG 
40 field. 

Care must be taken to hold the NBUS 21 masters reset during arbitration testing, as the Bus grant 
signals will be driven out of the DRAB 38. 

DCSR SBIT_CR_MASK - Bit: <I8> Type: RW 

45 

If a single bit error and correction to memory occurs (CSR:SBIT_CRRCTN = 1 ) when this bit is set, an 
interrupt to the processor will occur. This is to allow further investigation if multiple SBE's are happening in 
a short time period. If the bit is not set, then when CSR:SBIT__CRRCTN = 1, there will be no interrupt. 

50 DCSR CACHE ENABLE - Bit: <I9> Type: RW 

When this bit is set, cache is enabled. It is initialized to zero, i.e., cache disabled. 

DCSR FBPE<3:0> - Bits: <23:20> Type: RW 

55 

Force Byte Parity Error. Each of these bits, when set will cause a corresponding byte parity error on a 
memory read. Bit 0 will cause a parity error in byte 0, etc. 
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DCSR DIAG_CHKBITS<6:0> - Bits: <30:24> Type: RW 

These bits can be written with any value to be substituted into the normal checkbits in the ECC. They 
will be used for the normal checkbits when SUBS__CHKBITS is set. They are ignored if SUBS-CHKBITS is 
5 not set. These can be use to force single and multiple bit errors in the memory. 

DCSR SUBS_CHKBITS - Bit: <3I> Type: RW 

When this bit is set the value in DIAG CKBITS will be used for the checkbits in ECC. This can be 

70 used to force single and multiple bit errors in the memory. 

DRAB 38 DRAM BUFFER 30 ERROR REGISTER (DER) ( R ) 

Table 21: DRAB 38 Diagnostic Error 
15 Register bit definitions 

DPAB b*s« - 

3322222222221111111111 

iO9B*76543210987 65432 10987 6543210 
i I I I ! 1,1 I I 

20 OC |x|x|X|x|X|XiX|XlX|x|X|X)x|z| I I SBJSYNDROME | | MB_SY1TOR0ME J 

— + | , + , 

3 j M_R£FH_IN_P ROG 

| ' S REFR_IN_PROG 

BNK_ERR_IDENT<1 : 0> 

25 

This register must be written with a LONGWORD operation on a LONGWORD boundary. It is initialized 
to zero. 

30 DER MB_SYNDROME<6:0> - Bits: <6:0> Type: R 

This field holds the syndrome value when a multiple bit error has occurred, and will only be valid when 
the CSR:MBIT ERR bit is set. Any value read when the CSR:MBIT ERR bit is not set is not valid. 

35 DER M REFR IN PROG - Bit: <7> Type: R 

When set, this bit indicates there was a refresh in progress when a multiple bit error occurred. It will 

only be valid when the CSR:MBIT ERR bit is set. Any value read when the CSR:MBIT ERR bit is not set 

is not valid. 

40 

DER SB_SYNDROME<6:0) - Bits: <13:7> Type: R 

This field holds the syndrome value when a single bit error and correction has occurred, and will only 
be valid when the CSR:SBIT_CRRCTN bit is set. Any value read when the CSR:SBIT_CRRCTN bit is not 
45 set is not valid. 

DER S_REFR_IN_PROG - Bit: <I5> Type: R 

When set, this bit indicates there was a refresh in progress when a single bit error and correction 
so occurred. It will only be valid when the CSR:SBIT_CRRCTN bit is set. Any value read when the 
CSR:SBIT_CRRCTN bit is not set is not valid. 

DER BNK ERR IDENT<1 :0> - Bits: <17:16> Type: R 

55 This field identifies the DRAM 98 bank where a memory error occurred. The data will only be valid 
when the CSR:SBIT__CRRCTN bit is set. Any value read when the CSR:SBIT_CRRCTN bit is not set is 
not valid. 



30 



BNSDOCID: <EP 0632379A2J_> 



EP 0 632 379 A2 



DRAB 38 ERROR ADDRESS REGISTER (EAR) ( R ) 

Table 22: DRAB 38 Error Address 
Register bit definitions 



DRAB bas« 4 

3322222222221111111111 

1O98765432109B765432109676543210 

I I I I I ! I I I 

+-+-+-+-+-+-4-+-+-+-+-+-+-+- + -+-+-+- + +-+-+-+-+-+-+--*— + 

10 |0 10] 0| 0| ADDRESS |0|0| 



This register must be written with a LONGWORD operation on a LONGWORD boundary. 

15 

EAR ADDRESS<25:0> - Bits: <27:2> Type: R 

This register will contain the address of the error which occurred to set the CSR:ES bit and interrupt the 
processor 28. It will contain the address of the first error if multiple error bits are set in the CSR. The 
20 address does not include the region or the byte values. 

DRAB 38 ERROR DATA REGISTER (EDR) ( R ) 

Table 23: DRAB 38 Error Data Register bit definitions 

25 



DRAB bnaa + 

3322222222221111111111 

1098765432109876543210967654 3 210 

I I I I I I III 

+-+-+-+-+-+-+-+- + -+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ 

14 | DATA | 

30 +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ 



This register must be written with a LONGWORD operation on a LONGWORD boundary. 

35 

EDR DATAOkO) - Bits: <3f:0> Type: R 

This register will contain the data of the error which occurred to set the CSR:ES bit and interrupt the 
processor 28. It will contain the data of the first error if multiple error bits are set in the CSR. 

40 

DRAB 38 ERROR REGION REGISTER (ERR) ( R ) 

This register must be written with a LONGWORD operation on a LONGWORD boundary. 

45 EDR NADR RGN<3:0> - Bits: <3:0> Type: R 

This register will contain the upper four address bits of the NBUS 21 address when an error occurs 
(CSR:ES bit is set). If the CSR:IBUS 19__CYCLE bit is NOT set, then the transaction was on the NBUS 21 
and this field is valid. If the CSR:IBUS 19__CYCLE bit IS set and the transaction was an IBUS 19 to NBUS 
50 21, this field is valid. The field is not valid on just an IBUS 19 transaction error. 

Table 24: DRAB 38 Error Region 
Register bit definitions 

55 — — 
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DRAB bast + 

3322222222221111111111 ft 
109876543210987654321098763 4^3 2 10^ 

J — H — 1^— -I — I- — f K-J^ — — — H I — * I — -t~-» — -f-— -» — ' | « 1 ' 

18 |x|x|x|x|x|x|x|xlx|x|x|x|x|xtxjx|x|x|x|x|x|x|x|x| I 1 

! I 

| * — NADR RGN 
IADR~RGN 



m XADR^^ rour addXMfl bit- or th . IB os 19 wh-n an .rror 

occure <CSR:ES bit i« ««t) . If th« CSR:IBUS 19_CYCLE bit IS 8«t, th«n thtt •rror 

Table 25: DRAB 38 Region Setup 
Register bit definitions 



7 5 DRAB bfise + 

3322222222221111111111 

1O98765432109876543210987654321O 
| | J I I I I I 1 

10 |X|X|X|X| CABCSRJ CAACSR|x|X|X|x|CACHEB I CACHEA | I TON | BUFFR | 

transaction was on the IBUS 19 and this field is valid. 

20 

DRAB 38 REGION SETUP REGISTER (RSR) ( RW ) 

25 This register must be written with a LONGWORD operation on a LONGWORD boundary. Due to the 
many region-to-region constraints, bus specific region constraints as well as module hardware constraints 
that must be adhered to when programming this register, the power-up default register value should 
generally be used. 

30 SETUP BUFFER<3:0> - Bits: <3:0) Type: RW 

These bits define the region to access buffer memory. They are the upper 4 bits of the address driven 
by the processor 28 bus devices - IADDR<31 :28). However, they are only 3 of the 4 upper bits of the 
address driven by the port processor 32 bus devices - NADDR<30:28). NADDR<31) is used by the FX chip 
35 54 for decoding RAID XOR functions. It is initialized to 2 (0010). 

SETUP ITON<3:0> - Bits: <7:4> Type: RW 

This is the upper 4 bit address region that the processor 28 uses to address the port processor 32 bus. 
40 It is initialized to 3 (0011). 

SETUP CACHEA<3:0> - Bits: <11:8> Type: RW 

These bits define the region to access CACHE A. They are the upper 4 bits of the address driven by 
45 the processor 28 bus devices - IADDR<31 :28). However, they are only 3 of the 4 upper bits of the address 
driven by the port processor 32 bus devices - NADDR(30:28>. NADDR<31> is used by the FX chip 54 for 
decoding RAID XOR functions. It is initialized to 4 (0100). 



50 



SETUP CACHEB<3:0> - Bits: <15:12> Type: RW 

These bits define the region to access CACHE B. They are the upper 4 bits of the address driven by 
the processor 28 bus devices - IADDR<31 :28>. However, they are only 3 of the 4 upper bits of the address 
driven by the port processor 32 bus devices - NADDR<30:28>. NADDR<31> is used by the FX chip 54 for 
decoding RAID XOR functions. It is initialized to 5 (0101). 



55 
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SETUP CAACSR3:0> - Bits: <23:20> Type: RW 



These bits define the region to access CACHE A CSR. They are the upper 4 bits of the address driven 
by the processor 28. Port processor 32 devices can not access CACHE CSRs. It is initialized to 8 (1000). 



SETUP CABCSR<3:0> - Bits: <27:24> Type: RW 



These bits define the region to access CACHE B CSR. They are the upper 4 bits of the address driven 
by the processor 28. Port processor 32 devices can not access CACHE CSRs. It is initialized to 9 (1001). 



DRAB 38 l/D CACHE 44 HIT COUNTER (RSR) ( RW ) 

Table 26: DRAB 38 I/D HIT COUNTER 
definition 



DRAB bass + 

3322222222221111111111 

10997654321098765432109876543210 
I I I I I I i I I 

20 | I/D HIT COUNT <31:0> I 



This register must be written with a LONGWORD operation on a LONGWORD boundary. It is initialized 
25 to zero. 



IDH(31:0> - Bits: <3I:0> Type: R 



This is a 32 bit loadable counter which counts the number of I/D cache 44 bits for the type of 
30 information being cached (Code, Data or both) while the I/D cache 44 is enabled. Terminal count 
(FFFFFFFF) of either this counter or the I/D MISS COUNTer will cause both counters to freeze at their 
current value. In order to reset the I/D cache 44 counters after either has reached terminal count, this 
counter should be written with a reset value before the I/D MISS COUNTer is written. 



35 DRAB 38 I/D CACHE 44 MISS COUNTER (RSR) ( RW ) 



Table 27: DRAB 38 I/D MISS COUNTER definition 



DRAB bae« + 

3322222222221111111111 

lO98765432109876543210»87654321O 

I I I I I I I I I 

+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+- + -+-+-^+-+-+-+-+-+-4— + 

20 | I/D MISS COUNT <31;C> I 



45 

This register must be written with a LONGWORD operation on a LONGWORD boundary. It is initialized 
to zero. 

50 IDM<3I:0> - Bits: <3I:0> Type: R 

This is a 32 bit loadable counter which counts the number of I/D cache 44 misses for the type of 
information being cached (Code, Data or both) while the I/D cache 44 is enabled. Terminal count 
(FFFFFFFF) of either this counter or the I/D HIT COUNTer will cause both counters to freeze at their current 
55 value. In order to reset the I/D cache 44 counters after either has reached terminal count, the I/D HIT 
COUNTer should be written with a reset value before this counter is written. Writing this register causes 
both counters to start counting again. 
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There are a total of three arbitration functions within the DRAB 38 chip. There is arbitration between 
IBUS 19 and the NBUS 21 for access to the DRAM buffer 30. There is arbitration between the IBUS 19 and 
the NBUS 21 for access to the read cache memory. And there is arbitration between the 6 SCSI ports, Host 
Port, FX chip 54 and the processor 28 for access to the NBUS 21 bus. 

5 

BUFFER MEMORY ARBITRATION 

Priority of accesses to DRAM buffer 30 by the IBUS 19 and the NBUS 21 is equal. If both are 
requesting repeated access to the buffer memory, access will be granted to each alternately. 

10 

READ CACHE MEMORY ARBITRATION 

The NBUS 21 requests have priority for access to read cache memory over the IBUS 19. 
75 NBUS 21 

NBUS 21 devices have the following priority: 

Table 28: NBUS 21 

20 



Host port (highest priority) 

IBUS 19 to NBUS 21 accesses 

SCSI port 1 

SCSI port 2 

SCSI port 3 

SCSI port 4 

SCSI port 5 

SCSI port 6 

FX chip 54 (lowest priority) 

Devices 3-9 always arbitrate in a round-robin fashion. Devices 1 and 2 can be individually programmed to 
participate in the round-robin. They can also be programmed to win via a simple priority and not participate 
in round-robin with the other requestors. 

35 

SLAVE/DATA CACHE mode 

The DRAB 38 chip operates in both master and slave mode. Master mode is for Controller module 10 

applications. The slave mode is for cache module 110 110. The SLAVE H input sets the mode for the 

40 DRAB 38/1 14 chip. On controller modules 10, this pin is grounded, on cache modules 110, this pin is pulled 
to a logic "1". 

The DRAB 38 chip controls the transfer of data over the backplane between controller module 10 and 
cache module or modules 110. The backplane interconnect is the CDAL bus 26. The CDAL bus 26 is a 32 
bit multiplexed data and address bus with odd byte data parity. Each controller module 10 has a single 

45 CDAL bus 26 attached. Each cache module 110 has two CDAL busses 26A and 26B attached, one from 
each controller module 10A and 10B. Each controller module 10 has access to either of the two cache 
modules 110. Arbitration is provided by the slave DRAB 114A and 114B. 

In the master mode, the controller module 10 DRAB 38 decodes IBUS 19 and NBUS 21 requests, and 
initiates CDAL bus 26 transfers on behalf of the controller module 10 bus master. The cache module 110 

50 DRAB 114 receives the requests, performs the DRAM buffer 30 or CSR access requested and asserts an 
ACK or ERRor response. 

DRAB 38 CDAL bus 26 control signals 

55 CREQ L<1:0) - CDAL bus 26 transfer request/address latch Driven by the controller module 10 DRAB 

38 chip to initiate a CDAL bus 26 transfer. The line remains asserted during the transfer. At the end of the 
transfer, de-assertion of this signal terminates the transfer. There is one "req" for each cache module 110, 
from each controller module 10. The controller module 10 DRAB 38 decodes the region and asserts the 

34 
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proper request. 

CACK_L<1:0> - CDAL bus 26 transfer ack 

5 Driven by cache module 110 slave DRAB(s) 114. This signal is pulsed low once for each data longword 

transferred on the CDAL bus 26. 

CERR__L<1:0> - CDAL bus 26 transfer error 

io Driven by cache module's 110 slave DRAB(s) 114. Received by the controller module 10 DRAB 38. 

Cache module 110 CDAL bus 26 error, and cache module 110 mem error signal. One error signal from 

each of the two cache modules 110. This signal is asserted with the CACK L signal to indicate an error 

has been detected by the cache module 110 DRAB chip 114. The controller module 10 DRAB 38 then 
performs the proper error response sequence for the NBUS 21 and/or IBUS 19 master. 



15 



Table 29: BASIC CDAL bus 26 transfer 



CDAL -c.*ddr*«t >—c data 0 >-< data 1 

on CRE0 L 

\ / 

CACK L 



50 



55 



25 Other CDAL bus 26 control signals 

The following signals are required on the backplane between the controller module 10 and cache 
modules(s) 110. 

30 0 PRESENT A and C PRESENT B 

Backplane lines from each cache module 110 to each controller module 10. These lines are pulled up 
on each controller module 10. IF a cache module 110 is inserted in cache slot A, C_PRESENT__A is 
pulled down, indicating presence of the cache module 110 in the system to each controller module 10. 
35 These lines are received by the controller module 10 diagnostic read registers 80,82. 

C_LOCK <1:0> - Cache lock signals 

Driven by the controller module's 10 DIAG control register as a cache module 110 lock signal. A 
40 controller module 10 can "lock" a cache module 110 for its exclusive use. Assertion of the C__LOCK signal 
will inhibit a cache module 110 DRAB 114 from responding to a CREQ from any other controller module 10 
CDAL bus 26 master. A controller module 10 attempts to take ownership of a cache module 110 by 
asserting this line. If a subsequent transfer receives ACK, the lock is in place until this line is de-asserted. If 
the transfer times out, the cache module 110 is either locked to the "other" controller module 10 or not 
45 functional. 

If no C_J_OCK signals are asserted to a cache module 110, it will respond to either controller module 
10 master. 



INTERFACE SPECIFICATION 

The following information describes the electrical and timing interface requirements for the DRAB 38 
chip. 
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DRAB 38 PIN DESCRIPTIONS 

Table 30: 



MBUS 24 and DRAM 


98 INTERFACE SIGNAL PINS 


Signal 


Type 


#pins 


Total 


Description 


MDAL <31 :00>H 


I/O 


32 


32 


MEMORY DATA/ADDRESS BUS 


MDP <3:0>H 


I/O 


4 


36 


DATA PARITY 


MECC <6:0>H 


I/O 


7 


43 


ECC check bits 


RAS <3:0>L 


0 


4 


47 


Row Address Strobe 


CAS <3:0>L 


0 


4 


51 


Column Address Strobe 


DADDR <11:0>L 


0 


12 


63 


DRAM Address Lines 


WE L 


0 


1 


64 


Write Enable 


BCLK H 


I 


1 


65 


Bus Clock 



MDAL (31:0) H - Memory Data/Address Lines; 32 Pins; I/O 
20 The MBUS 24 is a data/address multiplexed memory bus. The DRAM buffer 30 data pins, DRAB 38, 

and the bus exchanger 100 chips tie directly to this bus. 
DP <3:0> - NBUS 21 Data Parity Lines; 4 Pins; I/O 
Odd byte parity lines for the MBUS 24. 

25 NBUS 21 Port Interface Signal Pins 

The following device signal pins attach to the NBUS 21 Port control signals. 

Table 31 

30 



35 



40 



NBUS 21 Port Signal Pins 


Signal 


Type 


#pins 


Total 


Description 


SIZ <1:0> H 


I/O 


2 


67 


NBUS 21 


Transfer SIZe 


NADDR <31:28> H 


I 


4 


71 


NBUS 21 


Region Address 


TS L 


I/O 




72 


NBUS 21 


Transfer Start 


TIP L 


I/O 




73 


NBUS 21 


Transfer in Progress 


RW L 


I/O 




74 


NBUS 21 


Read/Write L control 


TA L 


I/O 




75 


NBUS 21 


Transfer Acknowledge 


BB L 


I/O 




76 


NBUS 21 


Bus Busy 


BR <7:0> L 


I 


8 


84 


NBUS 21 


Bus Reqs./Cache Sel. 


BG <7:0> L 


0 


8 


92 


NBUS 21 


Bus Grant 


ARBWIN (2:0> H 


0 


3 


95 


NBUS 21 


Bus Master 



45 

SIZ <3:0> H - NBUS 24 SIZe Lines; 2 Pins; I/O 

Indicate the length of a bus transaction. The FX chip 54 only accepts LONGWORD slave accesses 
(SIZ =00) and performs only BURST DMA transfers (SIZ = 11). All bus transfers with SIZ other than 00 and 
1 1 are ignored. 
50 TS L - NBUS 24 Transfer Start Line; 1 Pin; I/O 

NBUS 24 TS/ Line. This line indicates that a transfer is starting and valid address and control lines are 
asserted on the bus. 

TIP L - NBUS 24 Transfer in Progress Line; 1 Pin; I/O 
NBUS 24 TIP/ Line. 
55 BR <7:0> H - NBUS 24 Bus Request/Cache Request; 2 Pins; I/O 

In controller mode, these lines are the inputs to the arbitration logic. 

In cache mode, BR<0> and BR<1> serve as the two requests for access to cache memory behind the 
DRAB chip. 

36 
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Processor 28 Port Interface Signal Pins 

Table 32 

5 



Processor 28 Port Signal Pins 


Signal 


Type 


#pins 


Total 


Description 


BE <3:0> H 


I 


4 


99 


byte enable lines 


BLAST L 


I 


1 


100 


Burst Last Signal 


DEN L 


I 


1 


101 


data enable 


ADS L 


I 


1 


102 


Address Strobe 


IRQ L 


O 


1 


103 


Interrupt Req. 


READY L 


O 


1 


104 


Ready Signal 


DC L 


I 


1 


105 


Data/Code Signal 


IADDR<31:28) L 


I 


4 


109 


Region Address 


MATCHA H 


I 


1 


110 


l/D Cache 44 Hit Bank A 


MATCHB H 


I 


1 


111 


l/D Cache 44 Hit Bank B 


LRUWL 


o 


1 


112 


l/D Cache LRW Write Line 



Cache Bus Interface Signal Pins 



25 

Table 33 



Cache Bus Interface Signal Pins 



Signal 


Type 


#pins 


Total 


Description 


CRW L 


I/O 


1 


113 


Cache Bus Read/Write Cntrl 


CREQ <1 :0> L 


I/O 


2 


115 


Cache Access Request 


CACK<1:0> L 


I/O 


2 


117 


Cache Access Acknowledge 


CERR<1:0> L 


I/O 


2 


119 


Cache Error Acknowledge 



CREQ L - Cache Bus Access Request; 2 Pins; I/O 

In controller module 10 master mode, this pin is an output only. This pin is asserted to start a transfer 
to a cache module 110. Just prior to asserting this signal, the cache address is placed on the CDAL bus 26 
40 lines along with C RW L. 

In cache module 110 slave mode, this pin is an input only. It receives the transfer request from the 
master controller module 10. 

CACK L - Cache Bus Access Acknowledge; 2 Pins; I/O 

In controller module 10 master mode, these pins are inputs only. The acknowledge handshake is 
45 received through this pin. 

In cache module 110 slave mode, this pin is an output only, and drives the acknowledge handshake 
signal. 



50 



55 
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Miscellaneous interface signal pins. 

Table 34 

5 



75 



Miscellaneous Interface Signal Pins 


Signal 


Type 


#pins 


Total 


Description 


MODE L 




1 


119 


Cache Slave/Cntrir Master 


MEMRW L 


1 


1 


120 


FX chip 54 R/W Cntrl pin 


XCVR <3:0> H 


0 


4 


124 


Bus Transceiver Control 


LOCK L 


0 


1 


125 


Buffer Memory LOCK 


CS L 


1 


1 


126 


DRAB 38 Chip Select 


RESET L 


1 


1 


127 


Master Chip Reset 



CS L - DRAB 38 Chip Select; 1 Pin; I 

The DRAB 38 internal registers can be read and written by the processor 28. Assertion of this chip 
select pin tells the DRAB 38 chip it is the target of a processor 28 transaction. The DRAB 38 chip performs 
20 the requested transaction and asserts READY L 

Referring additionally now to Fig. 5, a functional logic block diagram of the FX chip 54 is shown. All 
internal registers of FX chip 54 are accessed through the NBUS 21. FX chip 54 comprises a command and 
XPAGE ("PCX") registers 200 under control of port processor bus control logic 202 and three bits on 
MPORT lines 222. 

25 DMA registers 204 are also coupled to NBUS 21 providing a DMA indirect list pointer ("DILP") address 

register, a DMA page address register, DMA next page register and a DMA command register. DMA 
registers 204 in conjunction with PCX registers 200 provides an address ("XADDR") address on XADDR 
bus 206. DMA registers 204 are controlled by DMA control logic 214 which also supplies an input to 
multiplexer 210. 

30 NBUS 21 is coupled through bus data latch 216 to one input of an XOR logic function 218 having its 
other input coupled to the output of multiplexer 210 has as its input a logical "zero" input 212 as well as the 
output of XOR logic function 218 appearing on bus 220 through data latch 208. Bus 220 provides 32 bits of 
XORed data and four bits of XORed data parity. 
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FX Chip 54 CSR register 

Table 36: FX Chip 54 Control/Status Register (CSR) 

5 

FX base + 

+ + + + + 

20 | Control and Status I RW 

+ + + + + 

70 31 . 2 4 2 3 1 6 1 5 0 8 0 7 0 0 

DMA engine registers 

75 

Table 37: DMA Address registers (DADDR) 



25 



35 



40 



20 


JO I 


♦ - — - — - - - - 4. , — 

CMA. XnOlrvot List Vmqm Ac 




t Li»t orr»»t|0|oi 


RW 


— — f 






32 24 


23 


14 IS 


00 


07* 
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lOTTCR PKCt ADDRESS 


1 


VHX XPACX 


1 


R 






31 24 


23 


14 13 


08 


07 



00 



Table 38 : DMA command/Control register (DCMD) 



30 38 I CMD I OKA. INDLX COUNT I DMA. PACE COUNT | RW 

31 24 23 14 15 08 07 00 



REGISTER DESCRIPTIONS 

FX CHIP 54 PORT COMMAND and XPAGE REGISTERS (PCX) (RA/V) 

Table 39: FX CHIP 54 PORT COMMAND and XPAGE 
Register bit definitions 



rx baas + 00 through 1C, or rx baa* + 24 + ARB WIN 
3322222222221111111111 

10987654321096*765432109676543210 
I I I I I I I I I 

+ ■♦-+-« I +-+-+— 4 - i - k i +-< r I I t +-4—+-4 - + 

dt . |X|X!X|X|X|X|X|X|X|X|X|X|X| XPAGE | X | X } X | X | X \ 0 | CMD | 

I t I 

| | *-XOR COMMAND 

I I 
* — XBOT 102 PAGE ADDRESS 

50 

The FX chip 54 performs on the fly XOR for up to eight bus masters. There is one PCX register for 
each bus master. The currently selected bus master is communicated to the FX chip 54 through the three 
ARBWIN 168 signal pins. The decoded ARBWIN 168 value selects the corresponding PCX register. 
55 The X field can be written with any value, written values are returned on read. 
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PCX CMD - Bits<1 :0> Type: RW 

CMD<0> controls read-XOR-write operation. If this bit is SET, DATA transfers cause read-XOR-write 

operations to XBUF 102 to occur. 
5 CMD<1> controls the buffer memory Read/Write line. When this bit is set, buffer memory Read 

operations become XBUF 102 read operations with simultaneous buffer Write operations. This allows the FX 

chip 54 to transfer computed XOR blocks to a PORT while at the same time copying them to buffer 

memory, at the address specified by the port. 

If both CMD<0> & CMD<1) bits are set, data from the XBUF 102 will be transferred to a PORT and buffer 
10 memory as described above. In addition, since the XOR bit is also set, an XOR of XBUF 102 data with 

itself, or "0", is written back to the XBUF 102 location being accessed. This serves to clear the XBUF 102 

page in preparation for another operation. CMD<2> Controls the calculation of a running 'OR' for buffer 

compare operations. If this bit is set, a running 'OR* of all bit XOR results, is calculated. If this bit is reset, 

the running 'OR' is disabled, but the OTF Buffer Compare flags are not cleared. If CMD = 100#2, On-The- 
75 Fly XOR operations will occur but XBUF 102 will not be updated. CMD = 101#2 will cause the Read-XOR- 

Write Operations to occur, with running 'OR* for buffer compare. The resulting XBUF 102 data will be a bit 

map to all differences between the data buffers. 

If XBUF 102 slave address space is written by the port controller, and NADDR<31> = 1 (XOR operations 

enabled), The location referenced by the port is not updated. This allows for port OTF Buffer Compare 
20 operations which do not involve Buffer memory. These transfers complete faster than buffer memory 

references. A zerofill operation does NOT put the read data addressed by the PCX register on the NBUS 

21 ; it only zero's the address in XRAM. 

If CMD = 0, no operation occurs. The XBUF 102 memory is not modified. 

25 PCX COMPARE CLEAR - Bit<3> Type RAW1 C 

Writing 1 to this bit location clears the corresponding OTF buffer compare result bit in the FX chip 54 
CSR register. On PCX register read from the SCSI port, this bit reflects the same status as the CSR OTF 
buffer compare result bit. 

30 On a processor 28 read of any PCX register besides the PCXO, the COMPARE CLEAR bit will not 
reflect the correct status. The correct status of each PCX OTF buffer compare can be found only in the 
CSR for processor 28 accesses. The port processor 32 read of its corresponding PCX register will reflect 
the correct status in the COMPARE CLEAR bit. 

35 PCX XPAGE address - Bits<18:9> Type: RW 

This value is used as the page address into the XBUF 102 memory for read-XOR-write operations. 

FX CHIP 54 CONTROL and STATUS REGISTER (CSR) (R/W) 

40 

This register must be written with a LONGWORD Operation on a LONGWORD boundary. 
CSR CHIP REVISION - Bit<2:0> Type: R 
45 This field is reserved to hold the chip revision number. 
NBUS 21 PARITY CHECK DISABLE - Bit<3> Type: R 

This bit, when set, disables parity checking on the NBUS 21 for slave writes to the PCX registers of the 
so XBUF 102. When set this bit inhibits the NBUSPORT logic from asserting the parity error signal. This allows 
data with bad parity to be written to FX chip 54 registers or the XBUF 102 memory. 
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Table 40: FX Chip 54 CSR Register bit definitions 



Initialized to: 

000OO00O000OOOOO0OOOOO00OO0OOO1O 
FX baae + 

3322222222221111111111 

10987654321O987654321O9fi7654321O 



10 



ill! 
+-+-+-+-+-- f-+-+-+-+-+- +-+-+« 
I I I I I I I I I I I I I I 



TIMEOUT 



75 



20 



25 



I 



-+-+- 



-4.-+. 



| | | | |PORT | | REV | 



I 



xbut 



I i 

I * -CHIP REVISION 
| --NBUS 21 PARITY CHECK 
| DISABLE 

ERROR PORT 4 

ERROR SUMMARY 

XBUT 102 SLAVE PARITY ERROR 

NBUS 21 SLAVE PARITY ERROR 

NBUS 21 TIMEOUT ERROR 

102 PARITY ERROR 

NBUS 21 MASTER PARITY ERROR 

NBUS 21 TIMEOUT VALUE 

DMA. BUTTER COMPARE 

DMA ACTIVE 

DMA COMPLETE 

INTR 

INTR ENABLE 

OTT BUTTER COMPARE 

OTT BUTTER COMPARE 

OTT BUTTER COMPARE 

OTT BUTTER COMPARE 

OTT BUTTER COMPARE 

OTT BUTTER COMPARE 

OTT BUFFER COMPARE 

OTT BUTTER COMPARE 



30 CRS ERROR PORT - Bit<6:4> Type: R 

When an error is detected, the number of the controller module 10 NBUS 21 master at the time of the 
error is latched in this field. 

The value in this field is valid only when the ERROR SUMMARY bit is SET. 

35 

CSR ERROR SUMMARY - Bit<7> Type: R 

This bit offers a quick summary of chip error status. This bit is the logical OR of all error bits. This bit 

reflects the state of the ERR L pin. This signal can be used to inhibit the NBUS 21 arbitration logic, 

40 allowing the processor 28 control processor to access the chip before error state is lost. 

CSR XBUF 102 SLAVE PARITY ERROR - Bit<8> Type: W1C 

When the FX chip 54 XBUF 102 port logic detects a parity error on XBUF 102 slave read operations, 
45 this bit is asserted. 

This bit is cleared when WRITTEN to one. 

CRS NBUS 21 PARITY ERROR - Bit<9> Type: W1C 

so When the FX chip 54 NBUS 21 port logic detects a parity error on FX chip 54 slave register writes, or 
XBUF 102 memory slave writes this bit is asserted. 
This bit is cleared when WRITTEN to one. 

CSR NBUS 21 TIMEOUT ERROR - Bit<10> Type: W1C 

55 

This bit is asserted when a DMA transfer initiated by FX chip 54 does not receive a transfer ack 
response from memory within the timeout period. This could be the result of a bad memory address, or 
hardware failure. By setting this bit, and ending the bus cycle, we allow the processor 28 access to collect 
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error status. 

This bit is cleared when WRITTEN to one. 

CSR XBUF 102 MASTER PARITY ERROR - Bit<11> Type: W1C 

5 

When the FX chip 54 XBUF 102 port logic detects a parity error on XBUF 102 read while FX chip[ 54 is 
the NBUS 21 master (DMA operations), this bit is asserted. 
This bit is cleared when WRITTEN to one. 

io CSR NBUS 21 MASTER PARITY ERROR - Bit<12) Type: W1C 

When the FX chip 54 NBUS 21 port logic detects a parity error on NBUS 21 read operations, while the 
FX chip 54 is the NBUS 21 master (DMA operations), this bit is asserted. 
This bit is cleared when WRITTEN to one. 



75 



20 



25 



30 



35 



40 



CSR NBUS 21 TIMEOUT VALUE - Bit<18:13> Type: RW 



This field specifies the NBUS 21 DMA transaction timeout value. This value sets the NBUS 21 timeout 
period. Eacn time the FX chip 54 starts a master mode NBUS 21 operation (DMA), the transaction timeout 
counter is loaded with this value. The transaction timeout counter is down counted at 1/4 the BCLK rate. If 
the counter reaches zero before a transaction acknowledge is received, The NBUS 21 TIMEOUT ERROR 
bit is set 

This field is initialized to zero which means there will be no DMA timeout check by the FX chip 54. This 
field needs to be initialized to a non-zero value in order for a DMA NBUS 21 timeout to occur. 

DMA BUFFER COMPARE - Bit<19> Type: W1 C 

This bit indicates the result of a DMA buffer compare operation. A BUFFER COMPARE operation is 
nothing more than a DMA XOR operation where the running "OR" of all the bits output from the XOR 
engine is computed. The result of this running "OR" is reflected in this register bit This bit is SET when a 
non-zero result is computed. When SET, the BUFFER COMPARE DMA is terminated. The state or the DMA 
PAGE ADDRESS register, and DMA INDEX REGISTER will point to the location of the difference. The 

IRQ L pin is also asserted. 

This bit is cleared when WRITTEN to one, or when a new buffer compare DMA is started. 

CSR DMA ACTIVE STATUS - Bit<20> Type: R 

When this bit is set it indicates the FX chip 54 internal DMA engine is currently busy. When asserted, 
the DMA COMMAND and DILP registers must not be written. 

CSR DMA COMPLETE STATUS - Bit(21> Type: W1C 

When this bit is set it indicates the FX chip 54 internal DMA engine has completed a DMA operation. 
This bit is not asserted when an error occurs during a DMA operation. DMA Complete will be asserted when 
45 a DMA buffer compare operation terminates, even if the termination is a result of a buffer miscompare. 
This bit is cleared when WRITTEN to one. 

CSR INTR STATUS - Bit<22) Type: W1C 

so This bit reflects the state of the IRQ L pin. When SET, an interrupt request is being asserted to the 

processor 28. 

The FX chip 54 generates an interrupt, if enabled, when any of the ERROR bits in this register are set, 
and on DMA completion status. 

This bit is cleared when the error causing the interrupt is cleared. This will also reset the interrupt 

55 request to the processor 28. 
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CSR INTR ENABLE - Bit<23> Type: RW 

Setting this bit enables the IRQ L line to the processor 28. If reset, the IRQ_L line is always reset. 

This bit also disables the INTR STATUS bit from being set on an error. 

5 

OTF BUFFER COMPARE - Bits<31:24> Type: R 

These bits indicate the result of On-The-Fly (OTF) buffer compare operations. An OTF buffer compare 
operation is initiated when a PCX register is written with a Buffer Compare command. The port interface 
70 chip (port processors 32,34) then performs a normal data transfer to or from buffer memory. The FX chip 
54 computes a running 'OR* of all bit results from an On-The-Fly XOR of the data transferred, and the 
XBUF 102 XPAGE specified in the PCX register. If SET following an OTF Buffer Compare operation, a 
difference was detected. 

Bits in this field are read only status bits. Control/reset is contained in the associated PCX register. 

75 

FX CHIP 54 DMA INDIRECT LIST POINTER REGISTER (DILP) (RA/V) 



Table 41: DMA Indirect List Pointer register (DILP) 



3322222222221111111111 

10967654321096763432109876543210 
II I I I I I II 

30 I DMA Indirect List Pag« address | Li*t ottmmt 1 0 1 0 1 Wf 



DMA Indirect List Page address - Bits:<31:9> type: RW 

30 

The value in this field specifies the page address from which the list of DMA page address values will 
be read. 

List Offset - Bits:<8:2> Type: RW 

35 

This list offset can be written with an initial offset into the DMA Indirect List Page address. Following 
every page address value fetch, this value is incremented to point to the next LONGWORD in the list. All 
address values written to the DILP register must be LONGWORD aligned values. The list cannot cross page 
boundaries. The List Offset does NOT generate a carry into the List Page Address field. If a page crossing 
40 occurs, the List Offset will roll over to O's. No error will be flagged. 

FX CHIP 54 DMA BPAGE and XPAGE ADDRESS REGISTER (DADDR) (R) 



Table 42: FX CHIP 54 DMA PAGE ADDRESS 
Register bit definitions 



rx Usa + 

3322222222221111111111 

10587654321096765432109876543210 

II III till 

+ - +- +_ +- +- ^- +_+ 

34 | BUFFER PAGE ADDRESS 1 DMA XPAGE |R 



55 

DMA XPAGE - Bits:<9:0> Type: R 

This field holds the current DMA XBUF 102 page address value. 
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DMA BUFFER PAGE - Bits:<31:10) Type: R 

This field holds the current DMA BUFFER PAGE address value, bits 30:9. Table 43, FX CHIP 54 DMA 
PAGE ADDRESS shows the mapping if the page address fields to the NBUS 21 and XBUF 102 addresses. 

5 

Table 43: FX CHIP 54 DMA PAGE ADDRESS 



NBUS 21 buffer owory DMA addr»»e : 

3322222222221111111111 m m „ n 

10 10987654321098765432109076543210 

I I I I I I I I ■ 

+_+_+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+ 
|0 1 B OTTER PAGE ADDRESS I DMA XPAGE |0|Of 



15 



XBUT 102 DMA addr»a» r 



111111111 
87654321098 

I I t 

-+-+-+-+- 



76543210 
I I 

-+-+-+-+-+-+-+--»---+-+-+-+-«-— ►-+-+-+-+-+ 
| XBUT PAGE ADDRESS | DMA INDEX |0|0| 



20 These registers are not read/writable registers. They are created on a start of a DMA. NOTE that the 
actual Buffer Page Address is a one-bit right shift of the address that the operator loads into Buffer Memory 
(at the indirect DILP address). 

FX CHIP 54 DMA COMMAND REGISTER (DCMD) (R/W) 

25 

Table 44: DMA Command/Control register (DCMD) 



30 saaajaitiaiiiiiiiiiii* » t o 

i i 1 i ! ,, ,',,,, I 1 i , i ! 

! j 1 ' i -1M! tjuw orran 



35 

Writing this register with a non-zero DMA command initiates the DMA operation. The DILP register 
must have already been written with the starting address of the page list in buffer memory. 

40 DCMD LAST PAGE OFFSET - Bits:<8:2> Type: RW 

The value in this field sets the offset into the final page of a DMA operation at which the operation 
completes. The index written here will be the last longword of the DMA. If the DMA page count is ZERO, 
indicating final page, This value is compared against the DMA PAGE INDEX COUNTER value. If equal the 
45 DMA terminates. 



DCMD DMA PAGE COUNT - Bits:<15:9> Type: RW 

This field is written with the number of pages the DMA operation is to span. This number is 
50 decremented when the DMA is started, and again the start of each page. A value of ONE will cause the 
DMA operation to be restricted to a single page, TWO will transfer data from two pages etc. When the value 
is ZERO, the final page is being worked on. 

This register will not decrement past ZERO. If ZERO is written to this field, the DMA will span only a 
single page, exactly the same as if a value of ONE were written. 
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DCMD DMA INDEX COUNT - Bits:<24:18> Type: RW 

This field offers read access to the DMA engine's page INDEX counter. This value can be read for 
diagnostic testing, and to determine the index into a page where a buffer compare operation found a 
difference. 

DMA operations can begin at non-page aligned locations. Writing a non-zero value into this field will set 
an initial page offset into the first page of the DMA operation. 

DCMD CMD - Bits:<31 :28> Type: RW 

The CMD field determines the type of operation to be performed. 

Table 45 



DMA CMD Value 


CMD 


Transfer 


Description 




Type 




0000 


None ■ 


No operation, DMA does not start 


0001 


Read 


Copy Command, DMA from buffer memory 


0010 


Write 


Copy Command DMA to buffer memory 


0011 


Read 


XOR Command, DMA from buffer memory 






XOR with XBUF 102 page, result written back 






to XBUF 102 


0100 


Read 


BUFFER COMPARE Command, DMA from 






buffer memory XOR with XBUF 102 page 


1000 


Read 


ZERO Command, Performs NBUS 21 READ, 






ZEROs XBUF 102 page 


Note: The BUFFER COMPARE command does not write the resulting XOR back to XBUF 102. ZERO 


command NBUS 21 READ operations can address XBUF 102 space. This removes buffer memory cycle 


overhead from the transaction. 







ON-THE-FLY XOR 

To set up for XOR buffer generation, a single write to an FX Chip 54 PCXn register is required. Setting 
PCXn:CMD<0> will enable the XOR engine. Subsequent controller module 10 device bus 62 DATA 
transactions from a like numbered bus master will initiate Read-XOR-Write operations into XBUF. DATA 
transactions are all transfers in which NADDR<31) = 1. All Buffer Memory address space is thus double 
mapped. NADDR <31> is used to qualify all PCX commands. In the case of SCSI write operations the port 
processors 32 will write their PCXn register after target selection, but before it starts the write DATA DMA 
operations. 

In the case of SCSI disk read operations, with reselection, the port processor 32 writes its PCX register 
when reselected, before read DATA DMA operations occur. 

The FX chip 54 does not keep track of page index count, the port processors 32 provide the complete 
buffer address. Thus the FX chip 54 does not keep track of completion of buffer transfers. The only 
interrupt to the processor 28 from the FX chip 54 while performing on-the-fly XOR operations is in the event 
of error detection. 

The FX chip 54 can perform XBUF 102 read, buffer write, and XBUF 102 clear in a single memory 
access cycle. Setting the PCXn:CMD<0> is also set, an XOR of the XBUF 102 data with itself will be written 
back to the XBUF 102 location, thus clearing the location. 

If PCXmCMD = 011#2, and a reference is made to XBUF 102 address space, no memory write 
operation will occur. 
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DMA OPERATIONS 

The FX chip 54 has a single DMA engine. The engine reads buffer pointers from a list in buffer 
memory. The location of the list is defined by the value written into the FX chip 54's DMA Indirect List 
5 Pointer register. Once the DMA command register (DCMD) is written, the DMA engine performs a s.ngle 
LONGWORD "fetch" of the first pointer in the list. 

To allow for multi-page DMA operations, a method for chaining DMA's together is provided. When the 
page index counter ro.ls over (page crossing), and the DMA PAGE COUNT has not reached zero, a jingle 
LONGWORD DMA "fetch" operation is initiated. The buffer memory location pointed to by the DMA DILK 
,o register will be fetched and copied into the DMA Page Address register. Once the ^PA^ 0 ^"™ 88 
reached zero, no further DMA "fetch" operations are registered. Once the DMA LAST PAGE OFFSET is 
equal to the DMA INDEX COUNT (both found in the DCMD register), no further DMA's are requested. 

DMA operations can start on any LONGWORD boundary. The DMA INDEX COUNT field in the DCMD 
register can be written with an initial LONGWORD offset into the first page. DMA operations can stop on 
js any LONGWORD offset into the last page. 

ZERO FILL OPERATION 

The XBUF 102 can be zeroed, or cleared, by initiating a DMZ or a PCX operation. To zerofill XBUF 102 
20 with a DMA, a '1000' command is written into the DCMD register after the correct address es been set 
up in the DILP and Buffer Memory. The address can select Buffer Memory or XBUF 102. XBUF 102 
addressed operations complete faster. CSR DMA COMPLETE will be asserted when the zerofill finishes. 

To zerofill XBUF 102 with a PCX operation, a PCX command of '011' is written to the PCX reg.ster. 
This will clear the XBUF 102 memory pointed to by the XBUF 102 Page Address (the index is taken from 
25 the NBUS 21 address). 

DMA COPY OPERATION 

A command of '0001' written to the DCMD register will initiate a copy from Buffer Memory to XBUF 
102. A command of '0010' will initiate a copy from XBUF 102 to Buffer Memory. CSR DMA COMPLETE will 
be asserted when the DMA copy finishes. 

BUFFER COMPARE OPERATION 

A Buffer Compare can be initiated by either a DMA or a PCX operation. A DMA command of'0100* will 
compare data at the Buffer Memory and XBUF 102 addresses set up in the DILP indirect pointer. On a 
compare difference, CSR BUFFER COMPARE is asserted and IRQ_L is asserted. 

A PCX command of '100' will compare XBUF 102 data with the data currently on the NBUS 21 (through 
an NBUS 21 read or write). A command of '101' will do the compare plus it will also write the bit differences 
40 through to the XBUF 102. On a compare difference, PCX COMPARE CLEAR and CSR 0TF BUFFER 
COMPARE(x) are set. 

NBUS 21 PORT INTERFACE SIGNAL PINS 

45 The following device signal pins attach to the buffered port processor 32 bus (NBUS 21) port. The bus 
is Address/Data multiplexed. 
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50 
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Table 46: NBUS 21 PORT SIGNAL PINS 



10 



15 



Signal Type 


♦pins 


total 


Description 


DAL<31:00> 


HIO 


32 


32 


NBUS 


DP<3:0> H 


IO 


4 


36 


NBUS 


SI2<1:0> H 


IO 


2 


38 


NBUS 


TS L 


IO 


1 


39 


NBUS 


TIP L 


IO 


1 


40 


NBUS 


RW L 


IO 


1 


41 


NBUS 


TA L 


IO 


1 


42 


NBUS 


TEA L 


IO 


1 


43 


NBUS 


BB L 


IO 


1 


44 


NBUS 


CS L 


I 


1 


45 


NBUS 


XCS If 


I 


1 


46 


NBUS 


BCLK H 


I 


1 


47 


NBUS 


BREQ L 


O 


1 


48 


NBUS 


BGRT L 


I 


1 


49 


NBUS 


MEMWR L 




0 


1 


50 


ARBWIN<2:0> H 


I 


3 


53 


NBUS" 



Data/Address Bus 
Data Parity 
Transfer SXZe 
Transfer Start 
Transfer In Progress 
Read/WriteL Control 
Transfer Acknowledge 
Transfer Error Acknowledge 
Bus Busy 

Slave Chip Select 
XBUF Chip Select 
Bus Clock 
Bus Request 
Bus Grant 

Read/Writ eL Signal to Buffer Memory 
BUS Master Port 



DAL<31:0> H - NBUS 21 DATA/ADDRESS LINES; 32 PINS; I/O; 

Multiplexed 710 bus Data/Address pins. 
20 DP<3:0> H - NBUS 21 DATA PARITY LINES; 4 PINS; I/O; 

Odd byte data parity lines from the 710 bus. 
SIZ<1:0> H - NBUS 21 SIZe LINES; 2 PINS; I/O; 

Indicate the length of a bus transaction. The FX chip 54 only accepts LONGWORD slave accesses 
(SIZ = 00), and performs only BURST DMA transfers (SIZ = 11). All bus transfers with SIZ other than 00 and 
25 11 are ignored. 

TS L - NBUS 21 TRANSFER START LINE; 1 PINS; I/O; 

Buffered port processor 32 bus TS/ line. This line indicates that a transfer is starting and valid address 
and control lines are asserted on the bus. 

TIP L - NBUS 21 TRANSFER IN PROGRESS LINE; 1 PINS; I/O; 
30 Port processor 32 bus TIP/ line. 

TA L - NBUS 21 TRANSFER ACKNOWLEDGE; #1 PINS; I/O; 

Port processor 32 bus TA/ line. 
TEA L - NBUS 21 TRANSFER ERROR ACKNOWLEDGE LINE; 1 PINS; I/O; 

Port processor 32 bus TEA/ line. 
35 CS L - NBUS 21 SLAVE CHIP SELECT; 1 PINS; I; 

Slave register access chip select input. 
XCS L - NBUS 21 XBUF 102 SLAVE CHIP SELECT; 1 PINS; I; 

Chip select for XBUF 102 access. 
MEMWR L - BUFFER MEMORY READ/WRITE LINE; 1 PIN; OUT; 
40 The FX chip 54 buffers the port processor 32 buses' Read/Write line to buffer memory. This allows the 
FX chip 54 to write buffer memory on the same transfer as an XBUF 102 location is read. 
ARBWIN<2:0> H - NBUS 21 MASTER PORT; 3 PINS; I/O; 

The current winner of arbitration for NBUS 21 master. 
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XBUF 102 PORT INTERFACE SIGNAL PINS 

The following device signal pins attach to the XBUF 102 fast static rarn devices. 

Table 47 



w 



15 



20 



25 



30 



35 



XBUF 102 PORT SIGNAL PINS 


Signal 


Type 


#pins 


total 


Description 


XD<31 :00>H 
XDP<3:0> H 
XADDR(18:02> H 
XADDR_JNV<18:17> H 
XWE L 
XOE L 


IO 
IO 

0 
0 
0 


32 
4 

17 

2 

1 


82 

86 

103 

105 

106 


XBUF Data Bus 
XBUF Data Parity 
XBUF Address 
XBUF Address Inverted 
XBUF Write Enable 


0 


1 


107 


XBUF Output Enable 



XD<31:0> - XBUF 102 DATA LINES; 32 PINS; I/O; 

These lines tie to the XBUF 102 RAM's data pins. 
XDP<3:0> - XBUF 102 DATA PARITY LINES; 4 PINS; I/O; 

These lines tie to the XBUF 102 RAM's parity data pins. Odd byte parity is stored with all XBUF 102 
data Odd byte parity is checked on all XBUF 102 read operations. 
XADDR<18:2> - XBUF 102 ADDRESS LINES; 17 PINS; OUT; 

XPAGE is divided into 512 byte pages. These lines drive the address value to the XBUF 102 RAM 
chips. For 32K x 9 devices, only XADDR<16:2> pins may be used or 128K x 9 devices, XADDR(18:7> pins 
can be provided. 

XADDR INV<18:17> - XBUF 102 ADDRESS INVERTED LINES; 2 PINS; OUT; 

The lines are provided for expansion to extra 32K x 9 devices instead of 128K x 9 f s. 
XWE - XBUF 102 WRITE ENABLE; 1 PINS; OUT; 

This line drive the XBUF 102 RAM's Write Enable (WE_L) line. 
XOE<1:0) - XBUF 102 OUTPUT ENABLE LINES; 2 PINS; OUT; 

The XOE line drive the XBUF 102 RAM's Output enable (OE_L) line. 
Using 32K x 9 SRAM devices the FX chip 54 can drive 2 or 4 sub-banks; 128Kbyte (256 Pages) or 
265Kbytes (512 Pages) of XBUF 102. Using 128K x 9 SRAM devices the FX chip 54 can dr.ve up to 
1 Mbyte or 2048 pages of XBUF 102. 

PORT PROCESSOR 28 INTERFACE SIGNAL PINS 



Table 48 



PORT PROCESSOR 28 INTERFACE SIGNAL PINS 



Signal 


Type 


#pins 


total 


Description 


ERR L 


O 




108 


ERROR Summary output 


IRQ L 


O 




109 


Interrupt request to processor 28 


RESET L 


I 




110 


Master Chip reset 


TESTMODE H 


I 




111 


Used for internal testing purposes 


IVOUT H 


O 




112 


Used for internal testing purposes 
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Table 49 



JTAG SIGNAL PINS 


Signal 


Type 


#pins 


total 


Description 


TDO H 


0 


1 


113 


Test Data Output 


TDI H 


1 


1 


114 


Test Data Input 


TCK H 


1 


1 


115 


Test Clock 


TMS H 


1 


1 


116 


Test Mode Signal 



NBUS 21 PARITY ERRORS 

J5 The NBUS 21 has ODD Byte data parity. The FX CHIP 54 checks data parity on all NBUS 21 data 
transfers. 

XBUF 102 PARITY ERRORS 

20 The XBUF 102 port does check parity of outgoing XBUF 102 write data. A parity error will be flagged in 

the CSR if bad parity is written. This parity check can be disabled by setting the NBUS 21 Parity Check 
Disable bit in the CSR. 

During XOR type operations, the FX chip 54 must calculate XBUF 102 byte parity. The XBUF 102 port 
computes ODD byte parity on all XOR type operations. 
25 Parity is checked on all XBUF 102 read operations. This is required since parity is recalculated on all 
XOR type operations. 

INTERNAL PARITY 

30 On XBUF 102 read operations parity is passed through the FX chip 54. If there is a parity error on read 
operations, a bit will be flagged in the CSR, and the data with the bad parity will go out on the NBUS 21. 

While there have been described above, the principles of the present invention in conjunction with 
specific apparatus, it is to be clearly understood that the foregoing description is made only by way of 
example and not as a limitation to the scope of the invention. 

35 

Claims 

1. A computer mass storage device controller system comprising: 

first and second controller modules operatively coupled to a plurality of computer mass storage 
40 devices, said first controller module controlling a first subset of said plurality of computer mass storage 
devices and said second controller module controlling a second subset of said plurality of computer 
mass storage devices; 

a first signal line communicating between said first and second controller modules to indicate the 
presence of said first controller module to said second controller module and said second controller 
45 module to said first controller module; 

a communications channel interconnecting said first and second controller modules, said first and 
second controller modules being responsive to a controller present signal on said first signal line to 
enable communications between said first and second controller modules over said communications 
channel; and 

so a second signal line coupling said first and second controller modules, means enabling said first 

controller module to functionally disable said second controller module in response to an indication on 
said communications channel that said second controller module may have malfunctioned, including 
means for said first controller to unlock and assume control of said second subset of said plurality of 
computer mass storage devices, said controller system further comprising means to prevent each 

55 controller module when properly functioning from disturbing contents of mass storage devices cor- 

responding to the other controller module when properly functioning. 
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2. The computer mass storage device controller system of claim 1 further comprising: 

first and second data cache modules operatively coupled to said first and second controller 
modules by respective first data paths, said first and second data cache modules further having 
alternative second data paths for operatively coupling said first and second controller modules to said 
second and first data cache modules respectively, wherein said first controller module is further 
operatively coupled to said second data cache module by said second data path upon assuming 
control of said second subset of said plurality of computer mass storage devices. 

3. The computer mass storage device controller system of claim 1 wherein said first and second signal 
lines and said communications channel comprise a backplane into which said first and second 
controller modules are electrically coupled, wherein said communications channel, comprises an 
asynchronous communications channel, wherein said asynchronous communications channel com- 
prises first and second unidirectional communications lines, said computer mass storage device 
controller system further comprising first and second clear-to-send signal lines. 



75 



A fault tolerant computer mass storage device controller comprising: 

first and second controller modules for operative coupling to a plurality of computer mass storage 
devices, said first controller module for controlling a first subset of said plurality of computer mass 
storage devices and said second controller module for controlling a second subset of said plurality of 
20 computer mass storage devices; and 

a plurality of communications lines interconnecting said first and second controller modules; said 
communications lines for providing configuration and operational information between said first and 
second controller modules, said communications lines further including a controller module disable line 
for assertion by either of said controller modules to operatively disable an other of said controller 
25 modules in response to a malfunction indication from said other of said controller modules; 

means for enabling said first controller module operatively to disable a malfunctioning second 
controller module by asserting said controller module disable line, and means for said first controller 
module additionally assuming access to and control of said second subset of said plurality of computer 
mass storage devices. 



30 



The fault tolerant computer mass storage device controller of claim 4 further comprising: 

first and second data cache modules operatively coupled to said first and second controller 
modules by respective first data paths, said first and second data cache modules further having 
alternative second data paths for operatively coupling said first and second controller modules to said 
35 second and first data cache modules respectively, 

wherein said first controller module is further operatively coupled to said second data cache 
module by said second data path upon assuming control of said second subset of said plurality of 
computer mass storage devices. 

40 6. A computer mass storage device controller system comprising: 

first and second controller modules for operative coupling to a plurality of computer mass storage 
devices, said first controller module for controlling a first subset of said plurality of computer mass 
storage devices and said second controller module for controlling a second subset of said plurality of 
computer mass storage devices; and 

45 a communications medium interconnecting said first and second controller modules, said commu- 

nications medium for conveying an indication signal showing a controller presence, from one of said 
controller modules to the other of said controller modules when said one of said controller modules is 
included in said system, said communications medium including a plurality of controller module 
communications lines for communications between said first and second controller modules when said 

50 controller present indication is detected by both said first and second controller modules, said controller 
system including means to send a message to said second controller that said first controller is booting 
and means to disable a second malfunctioning controller as well as means to unlock and gain access to 
and unlock said second subset of said plurality of computer mass storage devices. 

55 7. The computer mass storage device controller system of claim 6 further comprising: 

first and second data cache modules operatively coupled to said first and second controller 
modules by respective first data paths, said first and second data cache modules further having 
alternative second data paths for operatively coupling said first and second controller modules to said 

50 
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second and first data cache modules respectively, wherein said first controller module is further 
operatively coupled to said second data cache module by said second data path upon assuming 
control of said second subset of said plurality of computer mass storage devices. 

8. The computer mass storage device controller system of claim 6 wherein said plurality of communica- 
tions medium comprises a backplane into which said first and second controller modules are 
electrically coupled, 

wherein communications between said first and second controller modules on said communications 
medium are conducted asynchronously. 

9. The computer mass storage device controller system of claim 8 wherein said communications medium 
further comprises first and second clear-to-send signal lines. 

10. The computer mass storage device controller system of claim 6 wherein said other of said controller 
modules is precluded from functionally disabling said one of said controller modules once said one of 
said controller modules has disabled said other of said controller modules. 



51 



EP 0 632 379 A2 




BNSDOCID: <EP 0632379A2J_> 



52 



EP 0 632 379 A2 



^CONTROLLER MODULE A 



I 



88A 



o 
cr 



o 



88B- 



DRAB 

( MASTER) 



38A 



7 



BUS 

EXCHANGER 
A 



CACHE MODULE A 




(CACHE MODULE B 




lQ-2 



108^^ 

/- - 

CONTROLLER MODULE B 



DRAB 

( MASTER) 




IIOB-^ 



38 B y tOO 3 _ 



BNSDOCID: <EP 0632379A2_I_> 



53 



EP 0 632 379 A2 



//OA 



CACHE A 
MODULE A 

B 



/OA 



1 



CONTROLLER MODULE A 
1 9 3 4 5 



120, 




*22 



HOB 



1 



I20 2 



120, 



CACHE 
MODULE B 



B 



/20 5 



°J2 



i 



6Z- 



720ol 
c — ~i. 



i 6 , 2 ' 

/ 

r 



62 2 
I 

/ 



\62j 



a 



€2 A 



120. 



1 




6?j 



/2Q 





O 


1 2 3 4 


5 




CONTROLLER MODULE B 





Fig-3 



/OB- 



54 

0632379A2 l_> 



EP 0 632 379 A2 




BNSDOCID: <EP 0632379A2J_> 



55 



EP 0 632 379 A2 




8 

CM 



A 
O 

V 

o 
u 

A 

O 
V 
Ui 

o 

£ 

X 



V 

a 



A 

v 

O 
<t 
Q_ 

X 



A 

V 
Q 



A 
CM 
V 

i 

X 



A 
ro 

V 

a 



55 



A 


A 


A 


<r 


m 


CD 


V 


V 


V 


a 


a 


a 




2 




a 


o 


o 


A 


A 


A 


<r 


m 


CO 


V 


V 


V 


LU 


LU 


LU 


O 


a 


§ 














X 


X 


X 



A 
V 

o 



A 

v 

LU 

< 

CL 
X 



CL 

o 
u 



O 



Lr 



cr 
a 
o 
< 



< 
a 



cr 

a 
< 

LU 

< 
a 



LU 

s 



5 




I 



CL 



BNSDOCID: <EP 0632379A2J_> 



56 



EP 0 632 379 A2 



to 



o 

LU 
> 

<r 

LU 
CO 
UJ 

cr 



to 



cvj 



o 



£3 




CO 



cn £ 

LU — 

a: u 



a 

> 
cr 

UJ 
V) 
LU 

cr 



rO 
CM 



cr 

UJ 



cnv cn 



CVJ 



A 
O 

to 

V 

^ CO 
CD S 



o 

UJ 

> 

UJ 

cn 

UJ 

cr 




2d 



§1 



<d < < 




cn cnO 



CVJ X* 
cn to O 



5 

I 



CP 



a 
to 



ro 



> 
cr 

UJ 

cn 



ro 

CVJ 



c\j 



to 
cvj 

id 

CVJ 



fO 



£ LU 



» cr 



o 

a", 



I 



in 



it, to" 



a. 

cr 
!_ a: 

_ CL A 

cn O- 
cn^ v 



CVJ 



cn 



m o 



COLO O 



uJn a 

CO LU 

rvj mpg 



°- ro 
tn v 

?§ 



cn 

XLU 

Lucr 



co»- LU 
oc: cn 
xouj 
. clct 



CM 



lO 
OJ 

CD 
CVJ 

CVJ 



CD 
CVJ 



CD 
CVJ 



c> •* cn 

COiDUJ 

i v cr 



o 

fO 



fO 



in 



CD 



f 



CD CO O 



cntn O 



?i2 



!fi= Sir 



*«*— 777 ^fe 

O ^ ce 



SB ^ 

8§§ 



i— > cn 



eg 



CD 



05 



O 



U 
LU 

cn 



cr 

a. 



x 

■ u — 

5? 



> 

cr 

UJ 
CO 
UJ 

cr 



cr 

LU f- 

i = 

■= — a- 

€ lu 

i 5 

a — 



2 
10 



cnrr | 



O 
LU 
> 

cr 

UJ 

CO 



BNSDOCID: <EP 0632379A2_I_> 



57 



THIS PAGE BLANK (USPTO) 



(19) 



Europaisches Patentamt 
European Patent Office 
Office europeen des brevets 



(12) 



(id EP 0 632 379 A3 

EUROPEAN PATENT APPLICATION 



(88) Date of publication A3: 

10.10.2001 Bulletin 2001/41 

(43) Date of publication A2: 

04.01.1995 Bulletin 1995/01 

(21) Application number: 94108649.8 

(22) Date of filing: 06.06.1994 



(51) mtci7: G06F 11/16, G06F 1 1/20 



(84) 


Designated Contracting States: 


• Brame, Richard G. 




DE FR GB IT 


Monument, Colorado 801 32 (US) 






• Mclean, Ronald H. 


(30) 


Priority: 04.06.1993 US 71710 


Elbert, Colorado 80106 (US) 






• Walker, Michael D. 


(71) 


Applicant: Compaq Computer Corporation 


Colorado Springs, Colorado 80903 (US) 




Houston Texas 77070 (US) 


♦ Fava, Thomas F. 






Colorado Springs, Colorado 80919 (US) 


(72) 


inventors: 


• Adams, Thomas Gene 


• 


Sicola, Stephen J. 


Peyton, Colorado 80831 (US) 




Monument, Colorado 80132 (US) 




• 


Jackson, James P. 


(74) Representative: Charig, Raymond Julian 




Colorado Springs, Colorado 80918 (US) 


Eric Potter Clarkson, 


• 


Oakey, Roger L. 


Park View House, 




Colorado Springs, Colorado 80908-3505 (US) 


58 The Ropewalk 






Nottingham NG1 5DD (GB) 



CO 

< 

CO 

CM 
CO 
CO 

o 

CL 

LU 



(54) Fault tolerant storage controller utilizing tightly coupled dual controller modules 



(57) A fault tolerant storage controller utilizing tightly 
coupled dual controller modules. The controller mod- 
ules each check to see if another controller module or 
cache module is present and, if so, then all configuration 
information with respect to the controller modules and 
attached devices are shared between them. Configura- 
tion information maybe entered into either or both of the 
controller modules and the information is shared dy- 
namically. Each cache module may be "locked" by an 
individual controller module to prevent the other control- 
ler module from inadvertently disturbing the contents of 
the other controller module's cache. During initialization, 
each controller module checks for the existence of an 
associated cache module and, if present, it is immedi- 
ately "locked" by the controller module. Should a con- 
troller module fail or give an indication of a malfunction, 
the other controller module will disable or "kill" the mal- 
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erations where the malfunctioning controller module left 
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controller module's storage devices preventing the loss 
of any host computer data. The controller modules are 
tolerant of the other controller module failing and then 
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recognized by the surviving controller module such that 
it does not disable the one that failed. The dual controller 
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